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Abstract 

We obtain the Brownian net of Sun and Swart (2008) as the scaling limit of the paths 
traced out by a system of continuous (one-dimensional) space and time branching and co¬ 
alescing random walks. This demonstrates a certain universality of the net, which we have 
not seen explored elsewhere. The walks themselves arise in a natural way as the ancestral 
lineages relating individuals in a sample from a biological population evolving according to 
the spatial Lambda-Fleming-Viot process. Our scaling reveals the effect, in dimension one, of 
spatial structure on the spread of a selectively advantageous gene through such a population. 


1 Introduction 


The Brownian net , introduced by [Sun and Swart (2008), arises as the scaling limit of a system 
of branching and coalescing random walk paths. It extends, in a natural way, the Brownian 
web, which originated in the work of Arratia (1979). In the Brownian web there is no branching. 
It can be thought of as the diffusive limit of a system of one-dimensional coalescing random 
walk paths, one started from each point of the space-time (diamond) lattice. Informally, the 
web is then a system of coalescing Brownian paths, one started from each space-time point. 


Fontes et al. (2004) formulated the Brownian web as a random variable taking its values in the 
space of compact sets of paths, equipped with a topology under which it is a Polish space. In 
this framework, the powerful techniques of weak convergence become available and as a result 
the Brownian web emerges as the limit of a wide variety of one-dimensional coalescing systems; 
e.g. Ferrari et al. (2005), Newman et al. (2005). This points to a certain ‘universality’ of the 
Brownian web. 

In the Brownian net, each path has a small probability (tending to zero in the scaling limit) 
of branching in each time step. The limiting object is (even) more difficult to visualise than 
the Brownian web, as there will be a multitude of paths emanating from each space-time point. 
Nonetheless, Sun and Swart (2008) show that it can be characterised through the systems of 
‘left-most’ and ‘right-most’ paths from each point, each of which itself forms a Brownian web 
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(with drift). Motivated by the study of perturbations of one-dimensional voter models, Newman 


et al.| (2015) show that, by starting from systems of random walks that branch, coalesce and 


die on the diamond lattice, the Brownian net can be extended still further to include a killing 
term. However, we have not seen the ‘universality’ of the Brownian net explored. Our main 
result, Theorem |3.5[ is a contribution in this direction. It establishes an appropriate scaling 
under which the paths traced out by a system of branching and coalescing continuous time and 
space random walks in one spatial dimension converges to the Brownian net. 

The original motivation for our work was a question of interest in population genetics: when 
will the action of natural selection on a gene in a spatially structured population cause a de¬ 
tectable trace in the patterns of genetic variation observed in the contemporary population? We 
deal with the most biologically interesting case of a population evolving in a two-dimensional 
spatial continuum in Etheridge et al. (2016). Our work in this paper uncovers some of the rich 
mathematical structure underlying mathematical models for biological populations evolving in 
one-dimensional spatial continua. In particular, we study the systems of interacting random 
walks that, as dual processes (corresponding to ancestral lineages of the model), describe the 
relationships, across both time and space, between individuals sampled from those populations. 

It is natural to ask whether the model of Newman et al. (2015) has a biological interpretation. 
It does: killing corresponds to a mutation term. This was observed by Sun and Swart (2008) 
(c.f. Etheridge et al. (2015)). However, in view of the technical challenges to be overcome to 
handle the additional killing term, even on a diamond lattice, we do not explore this further 
here. 

Our starting point will be the Spatial A-Fleming-Viot process with selection (SAFVS) which 
(along with its dual) was introduced and constructed in Etheridge et al. (2014). The dynamics 
of both the SAFVS and its dual are driven by a Poisson Point Process of events (which model 
reproduction in the population) and will be described in detail in Section [2] Roughly, each event 
prescribes a region in which reproduction takes place. A proportion v of the population in the 
affected region is replaced by offspring of a single parent. We shall refer to v as the impact of 
the event. In the absence of selection, the dual process of ancestral lineages is a modification of 
the ‘Poisson trees’ of Ferrari et al. (2005). With selection, our dual follows ‘potential’ ancestral 
lineages, which introduces a branching mechanism, with the rate of branching determined by 
the presence of lineages in a region, but not increasing with their density. Our main result, 
Theorem 3.5, is that in one spatial dimension and when the impact v = 1 (which prevents 


ancestral lineages from jumping over one another), when suitably scaled the system of branching 
and coalescing ancestral lineages converges to the Brownian net. 

Without selection, the corresponding objects converge (after scaling) to the Brownian web. 
In that setting, we believe (and Berestycki et al. (2013) provides strong supporting evidence) 
that the random walks can even be allowed to jump over one another and the only effect on the 
limiting object is a simple scaling of time (given by one minus the crossing probability of ‘nearby’ 
paths). This would mirror the results of Newman et al. (2005), in which systems of coalescing 
non-simple random walks with crossing paths are shown to converge to the Brownian web. When 
we try to include selection in this limit, allowing paths to cross has a more complicated effect, 
as we illustrate through simulations in Section [7j It is an intriguing open question to explain 
the pictures that we present there. 

(2014), scaling limits of the SAFVS were considered in which the local 
In that case, the classical Fisher-KPP equation and its 
The dual process of branching and coalescing lineages 


In Etheridge et al. 


population density tends to infinity, 
stochastic analogue can be recovered, 
converges to branching Brownian motion, with coalescence of lineages at a rate determined by 
the local time that they spend together. In this article we are interested in a very different 
regime, in which coalescence of lineages is instantaneous on meeting. 
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Although our result owes a lot to the existing literature, the continuum setting introduces 
some new features. In particular, some care is needed in extending the self-duality of the systems 


of branching and coalescing simple random walks that appear in Sun and Swart (2008) to an 


‘approximate’ self-duality of the cadlag walks in continuous time and space that are considered 
here. 

In Section [2] we introduce the SAFVS and its dual before providing a heuristic explanation 
for our scaling. In Section [3] we provide a self-contained account of the necessary background 


on the Brownian web and net. Our main result is then stated formally in Theorem 3.5, which 
is proved in Sections [4}{6j Finally, Section [7] presents a brief numerical exploration of the effect 
of allowing the random walk paths of the dual process to cross on the positions at time one of 
the left-most and right-most paths emanating from a point. 
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2 The SAFVS and its dual 

In this section we introduce the set of branching and coalescing paths with which our main result 
is concerned. They arise as the dual to a special instance of the SAFVS. The reader familiar 
with the SAFVS can safely refer to Definition |2.2| (and the three lines preceding it) for notation, 
take note of Remark 2.3 and then skip to Section [3j 


2.1 The SAFVS 


The Spatial A-Fleming-Viot process (SAFV) without selection was introduced in Etheridge 


(2008); Barton et al. (2010). In fact the name does not refer to a single process, but rather to 


a framework for modelling the dynamics of frequencies of different genetic types found within a 
population that is evolving in a spatial continuum. It is distinguished from the classical models 
of population biology in that reproduction is based on ‘events’ rather than individuals. This 
introduces density dependence into reproduction in such a way that the clumping and extinction 
which plagues classical models is overcome, whilst the model remains analytically tractable. For 


a survey of the SAFV we refer to Barton et al. (2013). 


There are very many different ways in which to introduce selection into the SAFV. Here we 
adapt the approach typically adopted to introduce selection into the Moran model of classical 


population genetics. A full motivation of this approach can be found in Etheridge et al. (2014), 
to which we refer the reader. 

We suppose that the population is divided into two genetic types, which we denote a and A, 
and is evolving in a geographical space which is modelled by M. It will be convenient to index 
time by the whole of M. At each time t, the population will be represented by a random function 
{wt(x), i£R} defined, up to a Lebesgue null set of M, by 


wt(x) := proportion of type a at spatial position x at time t. 
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A construction of an appropriate state space for x t-x wt(x) can be found in Veber and Wakol- 


binger (2013). Using the identification 

[ {w(x)f{x, a) + (1 - w(x))f{x, A)}dx = 


iRx{a,A} 


f(x, n)A4(dx, dn), 


this state space is in one-to-one correspondence with the space of measures on Rx {a, A} with 
‘spatial marginal’ Lebesgue measure, which we endow with the topology of vague convergence. 
By a slight abuse of notation, we also denote the state space of the process (wt)te R by M\. 


Definition 2.1 (One-dimensional SAFV with selection (SAFVS)) Fix 1Z E (0,oo) and 
v E (0,1] and let p be a finite measure on (0,7^]. Further, let II be a Poisson point process on 
1 x 1 x (0, oo) with intensity measure 


dx <8> dt<S> p(dr). 


( 2 . 1 ) 


The one-dimensional spatial A-Fleming-Viot process with selection (SAFVS) driven by (2.1) is 
the M\-valued process (wt)tem with dynamics given as follows. 

If (x, t, r) E II, a reproduction event occurs at time t within the closed interval [x — r, x + r\. 
With probability 1 — s the event (x, f, r) is neutral, in which case: 


1. Choose a parental location z uniformly at random within (x — r, x + r), and a parental type, 
k, according to wt-(z), that is k = a with probability wt-(z) and k = A with probability 
1 - w t -(z). 


2. For every y E [x — r,x + r], set wt(y) = (1 — v)wt~(y ) + ul{ K=a j. 


With the complementary probability s, (x, t, r) corresponds to a selective event within [x — r, x+r] 
at time t, in which case: 


1. Choose two distinct ‘potential’ parental locations z,z' independently and uniformly at ran¬ 
dom within (x — r, x + r), and at each of these locations ‘potential ’ parental types k, k', 
according to wt-(z),wt~(z') respectively. 


2. For every y E [x — r, x + r] set wt(y) = (1 — v)wt-{y) + ul{ K=K / =a j. Declare the parental 
location to be z if n = k! = a or k = k' = A and to be z (resp. z!) if k = A, k' = a (resp. 
k = a, k' = A). 


In fact this is a very special case of the SAFVS introduced in 

Etheridge et al. (2014), and even 

more special than those constructed in 

Etheridge and Kurtz 

(2014 

), but it already provides a 


rich class of models. We use the assumption that p has bounded support in Section [4j but it is 
far from necessary for the construction of the process. The assumption that parental locations 
are sampled uniformly from (—r, r) has become standard in the literature, but at no point do we 
use it; our proofs work equally well for any symmetric distribution on (—r, r). The parameter 
v is often refered to as the ‘impact’ of the event. It can be loosely thought of as inversely 
proportional to the local population density. For our rigorous results we shall take v = 1, 
meaning that during a reproduction event, all individuals in the affected region are replaced. 


2.2 The dual process of branching and coalescing lineages 

Our primary concern in this paper is the dual process of the SAFVS, a system of branching 
and coalescing paths that encodes all the potential ancestors of individuals in a sample from the 
population. 
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When there is no selection, the dual contains only coalescing random walks, and each such 
walk corresponds to the ancestral lineage I of some individual; meaning that t traces out the 
locations in space-time occupied by the ancestors of that individual. 

If selection is present, then, at a selective event, we cannot determine the genetic types of 
the potential ancestors of the event (and hence the type and location of the actual ancestor) 
without looking further into the past. To avoid intractable non-Markovian dynamics, in this 
case we define a dual which traces all the locations in space-time which could have contained 
ancestors of a sample S from the contemporary population. This leads to a system of branching 
and coalescing random walks, tracing all the potential ancestral lineages. 

The dynamics of the dual are driven by the same Poisson point process of events, II, that 
drove the forwards in time process. The distribution of this Poisson point process is invariant 
under time reversal and so we shall abuse notation by reversing the direction of time when 
discussing the dual. 

We suppose that at time 0 (which we think of as ‘the present’) we sample k individuals 
from locations x\,... ,x^, and we write ^, for the locations of the N t potential ancestral 

lineages that make up our dual at time t before the present. 

Definition 2.2 (Branching and coalescing dual) FixIZ £ (0,oo). Let II be a Poisson point 
process on M x M x (0, oo) with intensity measure 

dx (g) dt® /a(dr) 

where n is a finite measure on (0, 1Z\. The branching and coalescing dual process (E t )t>o is the 
U n>1 W 1 -valued Markov process with dynamics defined as follows: At each event ( x,t,r ) £ II, 
with probability 1 — s, the event is neutral: 

1. for each i such that Q_ £ [x—r,x+r\, mark the i th lineage with probability v, independently 
over i and of the past; 

2. if at least one lineage is marked, all marked lineages disappear and are replaced by a single 
lineage (the ‘parent’ of the event), whose location at time t is drawn uniformly at random 
from within (x — r, x + r). 

With the complementary probability s, the event is selective: 

1. for each i such that Q_ £ [x—r,x+r], mark the i th lineage with probability v, independently 
over i and of the past; 

2. if at least one lineage is marked, all marked lineages disappear and are replaced by two 
lineages (the ‘potential parents’ of the event), whose (almost surely distinct) locations are 
drawn independently and uniformly from within (x — r, x + r). 

In both cases, if no lineage is marked, then nothing happens. 

A potential ancestral lineage is any path obtained by following the locations of potential ancestors 
of an individual in the sample; whenever the potential ancestor corresponding to the current 
position of the lineage is marked in an event, the path jumps to the location of the potential 
parent of the event (if the event is neutral) or to the location of (either) one of the potential 
parents (if the event is selective). Of course there are now many such paths corresponding to 
each individual in the sample. 

Remark 2.3 If we take the impact v = 1, then paths of the dual process cannot cross one 
another. We will impose this condition for our main result. 
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This dual process is the SAFVS analogue of the Ancestral Selection Graph (ASG), introduced 


in the companion papers Krone and Neuhauser (1997) and Neuhauser and Krone (1997), which 


describes all the potential ancestors of a sample from a population evolving according to the 
Wright-Fisher diffusion with selection. Duality of this type can be expressed in several different 
ways, but perhaps the simplest is the statement that the ASG is the moment dual of the 
diffusion. To establish the analogous duality for the SAFVS, we would need to be able to 
identify E[]~[" =1 wt(xi)] for any choice of points x \,..., x n G M. The difficulty is that the SAFVS 
wt(x) is only defined at Lebesgue almost every point x and so we have to be satisfied with a 
‘weak’ moment duality. 


Proposition 2.4 /Etheridge et al. (201J^] The spatial A-Fleming-Viot process with selection is 
dual to the process (Ht)t>o tn the sense that for every k G N and if G C(M fc ) n L 1 (M, k ), we have 


E. 


•w 0 


J J>(xi,...,x k ) {n wt (xj ) j> dx i... dx k 

= / ip(xi ,... ,x k )E {xU '" jXk} 


n^^t) 

L i=i 


dx i... dx k . 


( 2 . 2 ) 


In fact, a stronger form of this duality holds, in which the forwards in time process of allele 
frequencies and the process of potential ancestors of a sample are realised on the same probability 


selection and Etheridge and Kurtz (2014) for the general case. 


space through a lookdown construction; see Veber and Wakolbinger (2013) for the case without 


From now on 

forwards in time refers to forwards for the dual process, 
i.e. the reversal of that in Definition 12.II 


2.3 The scaling 

We shall keep the impact of each reproduction event (i.e. the parameter v) fixed, but we rescale 
the strength s of selection. In addition we perform a diffusive rescaling of time and space. For 
our main result we require v = 1, but the heuristic argument presented here and the numerical 
experiments of Section[7j suggest that there should be a non-trivial limit for any fixed v G (0, 1). 
Let us now describe the appropriate rescaling. The stages of our rescaling are indexed by n G N. 

Recall that p, is a finite measure on (0,7£]. For each n G N, define the measure /i n by 
pF{A) = fi(n l ^ 2 A), for all Borel subsets A of M. At the nth stage of the rescaling, our rescaled 
dual is driven by the Poisson point process IT' on RxRx [0,oo) with intensity 

n 1 / 2 dx <g> ndt (g> /a n (dr). (2.3) 


The y/n in front of dx arises since the rate at which centres of events fall in an interval of length 
l in the rescaled process is the rate at which they fall in an interval of length yfnl in the unsealed 
process. Each event of IP, independently, is neutral with probability 1 — s„ and selective with 
probability s n , where s n = a/y/n for some a G (0,oo). Thus, the n th rescaling of our dual 
process is precisely Definition 2.2 with (2.3) in place of (2.1). 

Although not obvious for the SAFVS itself, when considering the dual process it is not hard 
to understand why the scaling above should lead to a non-trivial limit. If we ignore the selective 
events, then each ancestral lineage follows a compound Poisson process and rescales to a (linear 
time change of) Brownian motion. Now, consider what happens at a selective event. The two 
new lineages are born at a separation of order 1 /y/n. If we are to ‘see’ both lineages in the limit 
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then they must move apart to a separation of order 1 (after which they might, possibly, coalesce 
back together). Ignoring possible interactions with other lineages, the probability that a pair of 
lineages achieves such a separation is of order 1 /y/n. Therefore, in order to obtain a non-trivial 
limit (which differs from that in the absence of selection) we need 0(y/n) such branches per 
scaled unit of time, so we take ns n = ay 'n or s n = a/y/n. (This argument can also be used to 
identify the correct scaling of s n in order to obtain a non-trivial limit in higher dimensions, see 


Etheridge et al. (2016).) 


Evidently we can extend the duality of Proposition 2.4 to lineages that are sampled at 
different times. For each point p = (x,t) £ M 2 , we think of an individual living at (x,t) and, at 
the nth stage of the rescaling, construct the set vX (p) of the potential ancestral lineages of the 
individual at p. (The reason for the uparrow in the notation will become clear in Section 4.1 ) 
Thus Pn{p) is a set of branching and coalescing paths. Our main result will concern the limit 
when we consider the union of such sets of paths as p ranges over a countable dense set of 
space-time points. 


3 The Brownian net 


In order to state a precise result, we must introduce the Brownian net and, in particular, the 
state space in which convergence takes place. A short introduction to the Brownian web and 


net is provided in this section. For a detailed survey of the surrounding literature, see Schertzer 


et al. (2015). 


Once again, the reader familiar with this area can note our modification of the usual state 


space (detailed in Section 3.1) and Remark 3.1 for terminology, and then skip to the statement 


of our main result, which can be found in Section 3.3 


3.1 The state space 

We now introduce the state space for our processes. Since our branching and coalescing paths 

a 

are only cadlag (not continuous), to capture the convergence of Vn{p) we will need a modification 


of the state space (introduced by Fontes et al. (2004)) that is commonly used for the Brownian 
web and net. 

For s £ [— 00 , 00 ], we set 

D[s\ = {f:[s , 00 ] — > [— 00 , 00 ] ; / is cadlag on [s, 00 ] n (— 00 , 00 )}. 

For / £ Z)[s], it will be convenient to define cr(f) = <Jf = s to be the first time at which / is 
defined. We set 

M= |J D[t]. (3.1) 

te[— 00 , 00 ] 

For each s £ [— 00 , 00 ] and / £ Z)[s] we define a function / as follows. Let Kt = tanh _1 (t) and 
note that k is an order preserving homeomorphism between [—1,1] and [— 00 , 00 ]. (The specific 
choice of the function tanh is a convention in the literature. We use the symbol k in place of 
tanh to denote a change of time rather than rescaling of space.) Then if / £ M we define 


m = 


tanh 

1 + \^t\ 


(3.2) 


for t £ [ft 1 (crf), 1]. It follows immediately that / is cadlag. 
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Figure 1: Self duality of systems of coalescing random walks on the diamond lattice 
that converge to the Brownian web. The blue arrows represent the forwards in time 
coalescing random walks, while the red arrows represent the backwards in time dual. 


In Section 5.1 we define a generalization p of the Skorohod metric that acts on cadlag paths 


with possibly different starting times. We show (in Section 5.2) that 


dM(fi, fv) = p(fi, / 2 ) V | tanh(cr/ 1 ) - tanh(a/ 2 )| 


(3.3) 


is a pseudo-metric on M. In standard fashion, from now on we implicitly work with equivalence 
classes of M and, with mild abuse of notation, treat (M, cZm) as a metric space. In view 


of (3.2), the intuition for (3.3) is that convergence in ( M,cIm ) can be described as local Skorohod 
convergence of the paths plus convergence of the starting times. 


If we restrict to continuous pat 
the space (M, d~) introduced by 


is and replace p wit 


Fontes et al. 


(2004 


i the usual L°° distance, then we recover 
see (5.21). Convergence in the corre¬ 


sponding metric on continuous paths can be describee 
plus convergence of starting times. 


as locally uniform convergence of paths 
We define the set K.(M) of compact subsets of M, equipped with the Hausdorff metric, m, 


and including the empty set 0 as an isolated point. We show (in Section 5.2) that (M,cIm) is 
complete and separable; the space K(M) inherits these properties. Similarly, we write /C(M) 
for the space of all compact subsets of M. 


3.2 The Brownian web and net 


Arratia (1979) was the first to observe that the Brownian web exhibits a self-duality. It is most 
easily understood by first considering the prelimiting system of coalescing simple random walks, 
one started from each point of the diagonal space-time lattice. As illustrated in Figure [lj one 
can think of each path in the prelimiting system as the concatenation of a series of arrows, 
representing the jump made by the path out of each point of Z at each time t G Z, and there is 
then a natural dual system of arrows (on the dual lattice), pointing in the opposite direction of 
time, which ‘fills out the gaps’ between the walkers forwards in time. It is not hard to convince 
oneself that the law of the resultant system of backwards paths is equal to that of the forwards 
system, rotated by 180 degrees about the origin (0,0). Under diffusive rescaling, the forwards 
and backwards systems converge jointly to a pair (W, W), known as the double Brownian web, 
in which VV is the Brownian web and the dual web W has the same law as W rotated by 180 
degrees. 


Sun and Swart (2008) showed how to obtain an analogue of the Brownian web, which they 


dubbed the Brownian net , as the scaling limit of the paths traced out by a system of branching 
and coalescing simple random walks. If there is a branch at (x,t), then the random walker at 
















point x at time t has two offspring which it places at (x — 1) and (x + 1), so that the space-time 
point (x, t ) is connected by paths to each of (x — 1, t + 1) and (x + 1, t + 1). In order to obtain 
a non-trivial limit, the branching probability of each path in each time step is scaled to be 
Q(l/y/n), corresponding exactly to the scaling in the dual to the SAFVS; that is at the nth 
stage of the rescaling the probability that two paths, one stepping left and one stepping right, 
emanate from a given point is (/y/n. 

In contrast to the Brownian web, the Brownian net will have a multitude of paths coming 
out of each space-time point. The key to its characterisation is that it has a well-defined left¬ 
most and right-most path, which we denote l z and r z respectively, emanating from each point 
z = (i,t)el 2 and these determine what is called a left-right Brownian web. Essentially, the set 
of left-most (resp. right-most) paths form a Brownian web with a leftwards (resp. rightwards) 
drift. Thus, for any deterministic pair of /e-tuples of points (z i, ■ ■ ■, Zk), (z[, ..., z' k ,), the left¬ 
most paths l Zl ,... ,l Zk are distributed as coalescing Brownian motions with drift ( to the left, 
and the right-most paths r z /,... ry ; are distributed as coalescing Brownian motions with drift 
( to the right. 

Before we can fully describe the Brownian net, we must explain how a left-most path l z = 
lr XjS ) and a right-most path ry = ry/y) interact. Their joint evolution after time s V s' is the 
unique weak solution to the left-right stochastic differential equation 

dL t = £1 {L t ^R t }dB l t + £1 { Lt =R t }dB$ - (dt , 
dR t = £1 {Lt^Rt}dBl + Cl { Lt =R t }dBt + (dt, 


where B\, B[ and Bf are independent standard Brownian motions and if s < t then L s < R s 

Lt < R-t- 

system. 


Sun and Swart (2008) proved (weak) existence and uniqueness of the solution to this 


A straightforward extension of (3.4) is sufficient to specify the joint distribution of any finite 
collection of left-right paths, which are known as left-right coalescing Brownian motions. 


Remark 3.1 In Sun and Swart (2008) the drift parameter ( of the left-right stochastic differ¬ 


ential equation used to construct the Brownian net is allowed to vary but the diffusion constant, 
( 2 , of the Brownian motions is always taken to be one. Applying a linear time change to their 
construction yields general ( 2 and we will use such webs and nets (and results from elsewhere 
extended trivially to apply to them) without further comment. We shall refer to the Brownian 


net corresponding to the left-right system (3.4) as the net with drift ( and diffusion constant ( 1 


It remains to give a rigorous characterization of the Brownian net. One last ingredient is 
required. 


Definition 3.2 Let a : [<r Q ,oo) —> M and a! : [<7y,oo) —> M be paths. We say a crosses a! from 
left to right at time t 6 M if there exists t~ < t and t + > t such that a(t ~) — a'(t ~) < 0 and 
a{t + ) — a’(t + ) > 0 and t = infjs G (t~,t + ); (a(t~) — a'(t~))(a(s) — a'(s )) < 0}. 

We define a crossing of a’ by a from right to left analogously. We say that a crosses a' if it 
does so from either left to right or right to left. 

Given two paths a and a' which cross at say, time t, we can define a new path g by following 
a up until time t, and subsequently following a!. This procedure is known as hopping from a to 
a' at the (crossing) time t. Given a set P of paths, T-i C ross{P) is defined to be the set of paths 
obtained by hopping a finite number of times between paths within P. 


Definition 3.3 (Sun and Swart (2008)) The Brownian net A f is the JC(M) valued random 
variable whose distribution is uniquely determined by the following properties: 
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Figure 2: A wedge. In the notation of (3.5), the path r is in Xf, and the path l is in Xf. 
The last time at which both paths are defined is s, in this case given by cr(Z); r(s) < l(s) and, 
tracing backwards in time, T is the first time at which r = l. The wedge is the shaded region. 


1. For each deterministic z£l 2 , almost surely Af contains a unique left-most path l z and a 
unique right-most path r z . 

2. For any finite deterministic set of points Z\,..., Zk, z[,..., z' k , G M 2 , the collection of paths 
1 Z1 ,..., l Zk , r z ^,..., r z > has the distribution of a family of left-right coalescing Brownian 
motions. 

3. For any deterministic dense countable sets T> l ,T> r C M 2 , 

Af = T-L C ross{{lz ; 2 e F> 1 } u{r z ;ze V r }). 


The proof of our main result rests on verifying the conditions of Theorem 3.4, which provides 
criteria under which a sequence of processes converges to the Brownian net. It is obtained by 
combining Theorem 6.11 and Remark 6.12 of Schertzer et al. (2015). To state it, we require the 
notion of a wedge. Let ( Xf) and (Xf) be two random sets of paths such that their rotations 
by 180 degrees about (0,0) are /C(M) valued random variables. Take l G Xf and r G Xf, 
defined on time intervals (—oo ,cr(l)] and (—oo,<r(f)] respectively. We write s = a(l) A a{r) 
for the largest time at which both paths are defined. Suppose that r(s) < l(s ) and define 
T := sup{f < s : r(t) = l(t)} to be the first time time the paths meet (as we trace backwards in 
time). We call the open set 


W(r,l) :={(i,ii)eR 2 :T < u < s,f(u) < x < l(u)} 


(3.5) 


a wedge. This set is illustrated in Figure [2] We say that a path it started at time a n enters W 
from the outside if there exists a n < u < t such that (tt(u), u) ^ W and (n(t),t) G W. Here, W 
denotes the closure of W. 


Theorem 3.4 (Schertzer et al. (2015)) Let (Xf) and (Xf) be two sequences oflC(M) valued 
random variables. Let (X^) and (X[) be two random sets of paths such that their rotations by 
180 degrees about (0,0) are /C(M) valued random variables. Set X n = 'H C ross(X l n U Xf) and 
X n = H cross (X l n UXf). 

Suppose that: 


(&/) Paths in Xf (resp. Xf) do not cross. No path in X n crosses a path of Xf from right to 
left, and no path in X n crosses a path of Xf from left to right. No path in X l n crosses a 
path of Xf, and no path of Xf crosses a path of Xf. 
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(38) For any k E N, and any (z ±,..., z^k) C M x M there exists a convergent sequence 

(l'n,\ i • • ■ i In,hi r n , 1 j • • • m,k ) i 

where l Uj i E X l n ,r n ^ E Xf, whose limit (in distribution, in M 2k , as n —>• oo) is a collection 
of left/right coalescing Brownian motions started at (z\,, Z 2 k)- 

fitf?) Whenever k E N and l n E X l n and r n E Xf are such that (i n ,r n ) converges (in M 2 , in 
distribution, as n —> oo) to left/right Brownian motions ( l,f), the first meeting time of l n 
with r n also converges in distribution to the first meeting time of l with r. 

(@) Paths of X n do not enter wedges of X n from the outside. 

Then, X n converges (in /C (M), in distribution) to the Brownian net. 

3.3 Statement of the main result 


We are finally in a position to give a formal statement of our result. Recall from Section 2.3 
that Pn (p) is the set of potential ancestral lineages of the individual at p E M 2 at the nth stage 
of our rescaling. 

Let (LLi)neN be an increasing sequence of countable subsets of M 2 such that, for each n, V n 
is locally finite, and as n —> oo the set T> n becomes everywhere dense. 

We define s/{V n ) = U p ex>„ ^nG 9 )- The set sF(T > n ) contains the potential ancestral lineages 
of all p E V n . However, s8 ( V n ) is not an element of /C(M), since it is not a closed subset of M, 
and so at the very least we should consider its closure. This requires that we augment (D n ) 
to also include ancestral lineages / that extend backwards in time until time — oo, and we define 
/(—oo) = 0 for such /. We include oo in the domain of each path / by defining /(oo) = 0. 
Additionally, define the boundary paths 


B = {/(•) = -oo; Of e [- 00 , 00 ]}u{/(•) = 00 ; 07 e [- 00 , 00 ]}. 


(3.6) 


We then set V\{V n ) = &f(V n ) U B. Lemma 6.4 shows that V\((D n ) is an element of /C(M). 


Recall from Definition |2.2| that v is the probability that an ancestral lineage that lies in 
[x — r, x + r] at time t— is affected by the event (x, t, r) and that s n = a/y/n is the probability 
(at the nth stage of our rescaling) that an event is selective. Our main result is the following. 


Theorem 3.5 Let v = 1. As 


n 


00 , 'Pn(TVt) converges weakly to M in 1C (M) where, in the 


terminology of Remark 3.1, AT denotes the Brownian net with drift 


and diffusion constant 


2 f n 2 

C = 3 a / o r M dr )> 


4 

f = g J o r 3 p(dr). 


(3.7) 


(3.8) 


It rests heavily on the theory of 
the Brownian web and net, in particular on Theorem |3.4| We will now place this result in the 


The proof of Theorem |3.5| can be found in Sections 


context of existing work and outline some of the additional difficulties that are encountered in 
our setting. 

Consider, first, what would happen in the absence of selection. Our dual process reduces to 


a system of coalescing random walks and, as proved in Berestycki et al. (2013), after a diffusive 
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Figure 3: Complications. The diagram on the left illustrates the way in which paths can both 
coalesce and branch through the same event. The second diagram presents a case of multiple 
collisions in a ‘neutral’ event. 


rescaling one recovers a system of (instantaneously) coalescing Brownian motions. If we set 
v = 1 and take the centre of the event, rather than a randomly chosen point, as the location of 
the parent, then this corresponds to the process of ‘trajectoires d’exploration’ of Micaux (2007), 
who constructs a stochastic flow of maps by considering the dual started from every space-time 
point in the plane. If we specialise still further so that all events have radius 1 and we start 
‘exploration paths’ only from the centre of each reproduction event then we recover a cadlag 


version of the Poisson trees of 

Ferrari et al. 

2004 

). By interpolation we recover the Poisson 

trees themselves. In 

Ferrari et al. 

(2005) (see 

Ferrari et al. 

(2003) for a more detailed account) 


it is shown that under a diffusive rescaling the Poisson trees converge to the Brownian web. 

Even with the simplification v = 1, our prelimiting process is considerably more complex 
than that considered in Sun and Swart (2008). When lineages are covered by the same neutral 
reproduction event, they coalesce. In particular, more than two lineages can coalesce in a single 
event. At selective events, when we must trace two potential parents, we can see either just 
branching or, if more than one lineage lies in the region affected by the event, a combination of 
branching and coalescence (see Figure [3]). 

Further complications compared to systems of branching and coalescing simple random walks 
arise since (a) our ancestral lineages jump at random times and the displacement caused by such 
jumps is random; and (b) the motion of distinct ancestral lineages becomes dependent when 
they are within distance 27 Z of each other. 

In spite of the additional complexity, it still makes sense to talk about left-most and right¬ 
most paths and this will be the key to our analysis. In fact (b) can be handled through elementary 
arguments; it turns out that the time periods during which ancestral lineages are ‘nearby but 
not coalesced’ are too brief to affect the limit. 

In order to overcome (a), we must identify a dual system of (backwards in time) branching 
and coalescing lineages. At first sight, it is far from obvious that such a dual exists; in contrast 
to previous work, our pre-limiting systems will not be self-dual. We will construct a dual system 
with the property that, in contrast to Figure [lj after rotation by 180 degrees, although, sepa¬ 
rately, left-most and right-most paths in the dual have the same distribution as their forwards 
counterparts, the joint distributions of the forwards and backwards systems differ. The dual, 

is illustrated in Figure [4j 


which is defined in Section 4.1.1 


4 Convergence of left/right paths 

We now turn to the proof of our main result. Recall that we take v = 1 so that if a lineage is 
in the interval covered by an event then it is necessarily affected by it. 
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4.1 Paths and arrows 


In order to discuss the self-dual systems of branching and coalescing lineages that converge to 
the Brownian net, we must be precise about what we mean by ‘branching-coalescing paths’ 
and, in particular, have a notation for keeping track of the direction of time. We shall follow 
Fontes et al. (2004) in using segments of paths called arrows. Loosely speaking, paths are 
formed by concatenating arrows. A path (or an arrow) is an R-valued function whose domain 
is a subinterval of R. If a path/arrow is forwards (resp. backwards ), then ‘moving along it’ 
means moving along the image of the path forwards (resp. backwards) with respect to the usual 
(resp. reversed) order on the time domain. We shall use t to denote forwards and f to denote 
backwards paths. 

For a < b < c, the concatenation of forwards paths / : [a, b) —> R and g : [b, c) —> M refers to 
the function h : [a, c) —>• M which is equal to / on [a, b) and equal to g on [b, c). Concatenation 
of backwards paths is defined analogously. 

When we are following a backwards path or arrow we interchange left and right, in the same 
way as left and right interchange if we reverse the direction in which we walk. For clarity, we 
reserve the terms north, south, east and west for global directions associated to the plane M 2 
and use the terms right and left for local directions whose frame of reference depends on the 
direction in which we are travelling. 


4.1.1 Forwards and backwards paths 


Recall from Section 2.3 that IF 1 denotes the Poisson point process that drives the system of 
branching and coalescing paths at the nth stage of our rescaling. We refer to each ( x , t, r) G II" 
as an event affecting the set {f} x [x—r, x+?’] or, equivalently, affecting each point y G [x—r, x+r] 
at time t. The east- and west-most points of this event are (x + r, t ) and (x — r, t ) respectively. 
To each (y, s ) G (—oo, oo) x R we associate a unique forwards arrow (pointing due north) and a 
unique backwards arrow (pointing due south), defined as follows. Let 

T^ s = inf{t; 3(x, t, r) G II n ,y G [x — r,x + r],t> s}, 

T^ s = sup{t ; 3(x, t, r) G II" , y G [x — r, x + r], t < s}, 

be the times of the first event (non-strictly) north of (y, s ) and the first event (non-strictly) 
south of ( y,s ), respectively, that affects the point y. Let * G {f,i}- An arrow starting at 
(y,s) is simply a path a* : [s, T* ) —> R defined to be the constant function a* s (u) = y. We 


shall call the event (x, t, r) 


lim 


T* 


,(«(«),«) = (y^ T y,s 


that defines T* s the finishing event of a y , s . 


g n 

i G [x — r, x + r\ X {f}. 


y,s\ 

It must be that 


For each * G {f,i}, we can now associate two important sets of paths to each point (y, s). 
Let us first consider the forwards paths. The set Vn{y,s) is best described in words; it is the 
set of paths that are obtained by following the arrow ocy tS out of (y, s ) and then, every time we 
finish an arrow, following a new arrow that starts from (one of) the (potential) parent(s) of the 
finishing event of a yjS - In other words, the forwards paths from (y, s) correspond precisely to 
the set of potential ancestral lineages of an individual who lived at the point y at time s, that 
we described in Section 2.3 We include time oo into the domain of each such forwards path, 


and set the location at time oo to be 0 . 

The set Pn(y, s) of backwards paths is also best described in words. It is the set of paths 
obtained by first following the arrow a\ s out of (y, s ) and then, every time we finish an arrow 
4 . 


a 


y ,s' 


1. If the finishing event of ay jS ' is neutral with, parent at v, then 
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Figure 4: The movement of forwards and backwards paths (illustrated as interpolated arrows) about a 
selective event (left) and a neutral event (right). The events are shown as finely dotted horizontal lines 
and the (potential) parent(s) as small circles. Forwards paths travel northwards and backwards paths 
travel southwards, according to the compass shown between the two diagrams. 


(a) if y' < v, follow the arrow out of the west-most point of the finishing event of ctyy, 

(b) if y’ > v, follow the arrow out of the east-most point of the finishing event of a y r s i ■ 

2. If the finishing event of a y \ s ' is selective with potential parents at v < v' then 

(a) if y' < v, follow the arrow out of the west-most point of the finishing event of cty iS /, 

(b) if y’ > v 1 , follow the arrow out of the east-most point of the finishing event of oyy, 

(c) if y' 6 [u, v'], a path can follow either one of the arrows out of the east-most/west-most 
points of the finishing event of ayy. 

In analogy to forwards paths, we include time — oo into the domain of each such backwards path, 
and set the location at time —oo to be 0. See Figure [4] for an illustration of the forwards and 
backwards paths. 

In keeping with our previous notation, for each forwards/backwards path /, we write <?(/) = 
cTf , for the time at which it starts. 


4.1.2 Interpolated paths and arrows 

We wish to exploit the existing theory of Brownian webs and nets, which was developed in 
a setting restricted to continuous paths, and so we shall approximate the systems of (cadlag) 
forwards and backwards paths of the last subsection by corresponding systems in which the 
jumps have been interpolated. Ferrari et al. (2005) achieve this by simply taking paths that 
interpolate between the starting points of arrows. However, in our situation this would result 
in arrows which cross each other and, worse, pass through reproduction events that did not 
previously affect them. Instead, we adopt a ‘just in time’ approach to our interpolation: we find 
small non-overlapping intervals of time and space about each event in which to interpolate. 


Lemma 4.1 Let n 6 N. For each p = (x , t, r) 6 U n and each e > 0 define the set 
& e (x,t,r) = {( y,s ) el 2 ; \x - y\ < r, \t - s| < e}. 

Almost surely, there exists a map T : n n —> (0, oo) such that the sets (^x(p)(p))pen n are distinct. 
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PROOF: This follows essentially immediately since IP has finite intensity; consequently the set 
of time coordinates of points of II n , restricted to any strip [-K, JfjxRx [—7£, TZ], where K E M, 
has (almost surely) no limit point. ■ 

Let p E V^(y,s) and let cty :S '■ [s,Ty tS ) E R be one of the forwards arrows that make up 
p. Let p = (x,t,r) denote the finishing event of oty }S (so that, in particular, T* s = t a.s.). 
Suppose that a' is the next arrow in p and write z = a'(TyP for its starting point. We say 
that a : [s, t) —> M is the interpolated arrow of if both 

1 . 5y, s (u) = c4, s (u) for all u < T$ >s - T (p), and 

2 . 5y, s (u) is linear on [T^ s - T(p),t) and lim u ^ t a\ tS (u) = z. 

Note that the interpolation of an arrow depends on the path / in which it is contained. 

Given a forwards or backwards path / E Vn{y, s), we define the continuous path / to be the 
concatenation of the interpolations of the arrows within /, and additionally setting /(oo) = 0 
for forwards paths and /(—oo) = 0 for backwards paths. We define 

vUy, s ) = {/;/£ ^n(y> s )} 

and define the set of interpolated backwards paths Vn(y,s) in analogously. Of course, interpo¬ 
lated paths are close to their equivalent non-interpolated paths. 

Lemma 4.2 Let (y,s) E M 2 and let f E Vn(y,s). Then sup tg ( 0 . / 00 j | f(t) — f(t)\ < 21Zn ~ 1 / 2 . 
The analogous estimate holds for backwards paths. 


Proof: 


Note that af = cry. By definition, in the notation 


of Lemma 


4.1 


f(u) = f(u ) unless u is 


such that ( f(u),u ) E &r(x,t,r)i x i L r ) f° r some ( x,t,r ) E IP. When (/(«), u) E &r( x , t ,r){ x , t, r) 
we have | f(u) — f(u)\ < 2r. Since, by definition of IP we have r < IZn “ 1//2 , this completes the 
proof. ■ 


4.1.3 Left-most and right-most paths 

We now associate to each (y, s) four special paths. 

Definition 4.3 Left-most and right-most forward and backward paths are defined as follows. 

1. The left-most forward path from ( y,s ) is the element ofVn(y,s) obtained by choosing the 
(forwards) arrow with the west-most potential parent, whenever a choice is available. 

2. The right-most forward path from (y, s ) is the element ofVn(y , s) obtained by choosing the 
(forwards) arrow with the east-most potential parent, whenever a choice is available. 

3. The left-most backward path from ( y,s ) is the element ofVn(y,s) obtained by choosing 
the (backwards) arrow from the east-most point of the finishing event whenever a choice is 
available. 

4- The right-most backward path from ( y,s ) is the element ofVn(y,s ) obtained by choosing 
the (backwards) arrow from the west-most point of the finishing event, whenever a choice 
is available. 


15 



We will sometimes shorten ‘left-most’ and ‘right-most’ to l -most and r-most. 
For D C M 2 , f e {t, 1} and * G {l, r} we define 


Qff(D) = {/;/ = ,fy S is the f -most path of some (y, s ) G D}. 


Recall from Section 3.3 that, in order to exploit the compactness properties of our state space, we 


must also include some extra paths, corresponding to ancestral lineages that extend backwards 
in time until —oo. First, we say that a path / : M —» M is an infinite extender of Q^'(D ) if there 
exists a sequence — Qn\D) and a sequence (t m ) such that t m f —oo and f(t) = f m (t) 

for all m and t > t m . We make the corresponding definition for Qn (D) and, for * G {t,i} and 
f G {l,r} we define to be the se t Q f infinite extenders of Qfi\D). Recall also the 

boundary paths B defined in (|3.6|). Then, define 


V* ,ji (D) = Qff(D) U U B 


- r>*>t 


*.t ,inf( 


(4.1) 


and Vn\D) = {/; / G to be the corresponding sets of interpolated paths. 


We now verify Condition (&/) of Theorem 3.4 Recall, from Definition 3.2 what it means for 
two paths to cross. 


Lemma 4.4 Let D be any subset of M 2 and let n G N. Let f G {Z, r*} and * G Then, 

almost surely: 

1. For all jT g Vn\D) and G Vn\D), the paths f 1 ' and do not cross. 

2. For all f*,g* G the paths f* and g* do not cross. 

Further, the same results hold for interpolated paths ft G Vn(D) and ft G Vn\D). 


Proof: In the first case, note that two forwards paths can only cross if they are first coalesced 
and are then subsequently affected by the same selective reproduction event. In the second case, 
note that a forward path can only cross a backwards path if both are affected by a common 
event. In both cases, the fact that crossing cannot occur is then an easy consequence of the 
definitions (or see Figure [1]). ■ 


Remark 4.5 A forwards left-most path can cross a backwards right-most path, and a forwards 
right-most path can cross a backwards left-most path. Similarly, a forwards left-most path can 
cross a forwards right-most path (if they are both affected by the same selective event), and a 
backwards right-most path can cross a backwards left-most path. 

Although not immediately obvious from the definition, the next lemma is a helpful feature 
of our construction. 

Lemma 4.6 A forwards left- (resp. right-) most path has the same distribution as a backwards 
left- (resp. right-) most path which has been rotated by 180 degrees. 

PROOF: The proof is based on the movements of paths affected by reproduction events, which 
is depicted in Figure |4j It suffices to consider the case of left-most paths; the case of right-most 
paths then follows by symmetry. 

First observe that the rate at which an event falls on (an arrow in) a path has the same 
distribution whether we look forwards or backwards in time and, when an event falls on a 
(forwards or backwards) path, the spatial position of the path will be uniformly distributed over 
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the region affected by the event. Let us denote that position by V. Thus if the event corresponds 
to p = (x,t,r), then V is uniformly distributed on \—r,r\. 

Consider a left-most forwards path affected by a neutral event. The path jumps to the 
position of the parent, which we denote by U. Thus, on the event V < U our path jumps a 
distance U — V to the left, and on the event U > V it jumps a distance V — U to the right. 

Now consider the left-most backwards path. Retaining the notation above, at a neutral 
event, on the event V < U the path jumps to the west-most endpoint, which, once rotated by 
180 degrees becomes a jump to the right of size V — (— r). On the other hand, on the event 
V > U, the path jumps to the east-most endpoint, which upon rotation becomes a leftwards 
jump of magnitude r — V. 

Conditional on V < U, V is uniform on (—r, U), so U — V = V— (—r). Similarly, conditional 
on V > U, V is uniform on (U. r ) and V — U = r — V. Therefore, if we restrict to only neutral 
events, forwards left-most paths and backwards left-most paths rotated by 180 degrees have the 
same distribution. 

Next, consider a selective event. We use a similar argument. The two potential parents are 
sampled uniformly from the event. We denote their positions by U\ < U 2 ■ Combined with V, 
we now have three independent uniformly distributed random variables on [—r, r\. Let us write 
them in ascending order as U^\ U^ 2 \ . The following events may occur: 

(a) V' = U^\ in which case U\ = U^ 2 \ so the path makes a rightwards jump of magnitude 
Ui-V-, 

(b) V 7 / U' 1 ', in which case U\ = EA 1 ), so the path makes a leftwards jump of magnitude V — U\. 

Note that we are not concerned by the value of C/ 2 , since we are interested in a left-most path. 
For a left-most backwards path, again at a selective event, the following events may occur: 

(a) V = E/«, in which case the path jumps to the west end-point of the event, a jump which 
after rotation by 180 degrees becomes a rightwards jump of magnitude V — (—r); 

(b) V + C/ (1) , in which case the path jumps to the east end-point of the event, a jump which 
after rotation by 180 degrees becomes a leftwards jump of magnitude r — V. 

Again, because we consider a left-most path we are not concerned by the value of U- 2 - 

We now compare the jumps in the (a) cases. Conditional on f < JJ\, V is uniformly 

distributed on (—r,Ui) and thus (as in the neutral case) V — U\ = V — (—r). Similarly, for 
the (b) cases, conditional on U\ < V, V is uniformly distributed on {U \, r) and thus (also as in 

the neutral case) V — U\ = r — V. Thus the left-most forwards path and the rotated left-most 
backwards paths have the same distribution, which completes the proof. ■ 


Remark 4.7 1. Note that Lemma 4-6, with the same proof, remains true when the parent 

locations are sampled according to any symmetric distribution on (—r, r). 


2. In previous work on the Brownian web and net, there is a strict self-duality in the prelimit¬ 
ing systems. Here, we see a new feature. Although separately the left and right-most paths 
have the same distributions forwards and backwards in time, their joint distribution differs. 
As can be seen in Figure [7J our backwards paths branch less frequently than forwards ones, 
but when they do branch, they make larger jumps. 

Recall the state space fC(M) defined in Section [3T| The space /C(M) is an appropriate space 
in which to consider convergence of sets of (branching/coalescing) forwards paths, but it is not 
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suitable for backwards paths. To remedy this, if P is a set of backwards paths then we define 
—P = {/;/£ P}, where / : [—cry, oo] — > [— 00 , 00 ] given by f(t ) = — /(— t) is the rotation of / 
by 180 degrees. Thus, — P E M is a set of forwards paths. With a slight abuse of notation, if f n 
is a sequence of backwards paths and / is a backwards path, we will say /„—>•/ in M if f n —> f 
in M. Similarly, if P n is a sequence of sets of backwards paths and P is a set of backwards paths 
we write P n —> P in /C(M) to mean that — P n —> —P in /C(Af). We apply the same terminology 
to interpolated paths. 


4.2 Convergence of a pair of left/right paths 


We must ultimately verify that any limit point of our combined systems of left and right-most 

As a first step, in this subsection we take 


of Theorem 3.4 


paths will satisfy condition 
the limit of a pair of paths, comprising one left-most path and one right-most path started at 
some time s (which, since IT" is homogeneous in both space and time, we may, without loss of 


generality, take to be zero) and show that it satisfies the system (3.4). Our approach mirrors 


that in Sun and Swart (2008), and as far as possible we shall adhere to their notation. With 


this in mind, let L n and R n denote respectively the left-most and right-most forwards paths 
associated to the points (y n ’ 1 , 0) and (y n,r , 0). We assume that the sequences of starting points 
converge to (y l , 0 ) and (y r , 0) respectively. 


Remark 4.8 A straightforward modification (in order to take into account the selective events 
and the resulting drift of the left and right-most paths) of Lemma f.l in Berestycki et al. (2013) 
shows that the pair ( L n ,R n ) stopped when it first enters the ‘coalesced’ state converges in distri¬ 
bution to a pair of independent Brownian motions with drift ±£ ; stopped when they first meet. 
In particular, the first meeting times also (jointly) converge. 


Berestycki et al. (2013 


Following Sun and Swart (2008), using their Lemma 2.2, when Lq < Rq there is a one-to-one 
correspondence between weak solutions of (3.4) and solutions of the system 


dL s = fdB l Sa +fdB c Cs -Qds, 
dR s = fdB r Sa + fdBQ s + fds, 
s — S s T C s , 

0 = f 1 {L s < Rs\dC s , 

Jo 


(4.2) 

(4.3) 

(4.4) 

(4.5) 


where B l ,B r and B c are independent standard one dimensional Brownian motions. The in¬ 
finitesimal variance £ 2 of the Brownian motion and the drift f depend on a and y and are 
given by (3.8) and (3.7) respectively. The solution ( L S ,R S ) to this system is a C K 2[0,oo) valued 
process. 

In the case Ro < To, according to (3.4) both R s and L s evolve as independent Brownian 
motions, with drift ±£, until they meet. Thus, in view of Remark 4.8, it suffices to treat the 
case of Lq < Rq, where Lq = y l and i?o = y r ■ 

The essence of (4.2)-(4.5) is that, once L s and R s meet, they will accumulate non-trivial 


time together as a result of a sticky interaction (see Proposition 2.1 in 

Sun and Swart 

(2008) 

for details). As part of the proof of their Lemma 2.2, 

Sun and Swart 

(2008 

) show that S s = 


fo 1 {L u < R u }du and C s = £ 1 {L u = R u }d 


u. 


Proposition 4.9 LetT E (0,oo). Asn—> 00 , (Lf , Rs) se [o,T] converges weakly to (L s , R s ) s e[o,T]) 
in the sense of ZD ffi 2[0,T] valued processes. 
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The analogous result for interpolated paths, which we denote by ( L n ,R n ), follows easily: 

Corollary 4.10 LetT E (0,oo). As n —> oo, (Lf , Rf ) sS [o,t] converges weakly to (L s , R s )se[o,T]) 
in the sense of C R 2 [0, T] valued processes. 


Proof: By Lemma 4.2, the weak convergence of Proposition 4.9 also holds (in D R 2[0, T]) 
when ( L n , R n ) is replaced by (L n , R n ). Since the space of continuous paths with the supremum 
topology is continuously embedded in the space of cadlag paths with the Skorohod topology, it 
follows that the same convergence holds in (7^2 [0, T}. ■ 

The remainder of this subsection is devoted to the proof of Proposition |4.9| We begin by 
breaking down the evolution of the pair (Lf, R (?) into several different pieces. At time s > 0, we 
say Lf and Rf are 


For s > 0 we set 


n 


coalesced if Lf = Rf, 

nearby if L n s / R n s and \L n s - R n s \ < 

separated if \Lf — iPj > ^ . 


Cf = I are coalescedjdu, 

Jo 

N " = [ are nearbyjdu, 


2 n 
1/2 ’ 


(4.6) 


Sf = I 1 are separated} du, 

Jo 

and we note that C™ + iV" + S'f = s. 

We dehne sequences of stopping times to track the changes of state of ( L n , R n ) during [0, T], 
Firstly, 

t[ 1,C = inf{s > 0 ; (Lf, Rf) are coalesced}; 

T k’ C = i n f{ s ^ T k- 1 ! (T”, R.f ) are coalesced, (Lf_,Rf_) are not coalesced}. 


Similarly, we define sequences tJ!’ N and t(!' for ‘re-entrance’ times of (Lf,Rf) to the states of 
‘nearby’ and ‘separated’ respectively. It is easily seen that each rf' C , rJ?’ N , tJ?’ S is a stopping 
time and ( L n ,R n ) is strong Markov. 

Each jump of ( L n ,R n ) is caused by one or both lineages being affected by a single event of 
IP. If ( L n ,R n ) is coalesced immediately before this event then the event affects both L n and 
R n , whereas if they are separated the event affects only one of the two. The motion is more 
complicated when ( L n ,R n ) is in the nearby state, when events can affect one or both lineages, 
but we shall see that the time spent in that state is negligible as we pass to the limit. 

In order to identify the limiting objects, it is convenient to isolate the parts of the motion 


that contribute to the drift from those that contribute to the martingale terms in (4.2) and (4.3). 


The decomposition we make is not unique. Our particular choice highlights the fact that the 
martingale part of the motion of lineages is driven by neutral events, while the drift can be 
attributed to selection. 

First we are going to define three random walks, from which we can build ( L n , R n ) when we 
are in the coalesced or separated states. To understand the origin of these, first suppose that a 
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lineage is hit by a neutral event. When this happens, the position, y, of the lineage is uniformly 
distributed on the region affected by the event and it will jump to the position z of the parent, 
which is also uniformly distributed on the region. Neutral events fall according to a Poisson 
Point Process with intensity 

n l t 2 dx (g> ?7.(1 — s n )dt <g> y, n (dr), 


so they hit y at rate 

/ oo roc pr 

/ / 1 {y £{x — r,x + r]} dx n n (dr)n}l 2 dx 

-oo J 0 J —r 
poo 

= 2n(l — s n ) / ry{dr). (4.7) 

Jo 

We define V n to be a symmetric random walk driven by a Poisson Point Process with intensity 

n(l — s n )dt (g> 2 r/j,(dr). 

At an event (t,r), the walk jumps with displacement Ji/y/n where 

F[J 1 eA\=F [Z r -U r e A\, (4.8) 

and U r and Z r are independent uniform random variables on [0,2?’]. 

Now consider the motion due to selective events. If the pair is coalesced immediately before 
the event, then their position is uniformly distributed on the affected region and the left-most 
path will jump with displacement z\ — y and the right-most path jumps with displacement 
Z 2 — y where z\ < Z 2 are the (uniformly distributed) positions of the two potential parents of 
the event. If the pair ( L n , R n ) is separated, then only one of them will be affected by any given 
event. Selective events fall with intensity 

n l ' 2 dx <g> ns n dt <8> fi n (dr). 

We define a random walk (D n ~,D n,+ ), whose jumps are driven by a Poisson Point Process with 
intensity 

ns n dt <g> 2 ry(dr). 

At an event (t,r), D n, ~ jumps with displacement (Z\ — Y)/y/n and D n,+ jumps with dis¬ 
placement {Z 2 — Y)/y/n where Z\ = min{I/i, U 2 } and Z 2 = max{I/i, U 2 } with U\, U 2 and Y 
independent uniformly distributed random variables on [0,2r], 


Lemma 4.11 As n —»• oo, V n converges weakly to where B is a standard Brownian motion 
and 



4 

9 



r 3 /i(dr). 


Proof: Evidently J\ has mean zero and, conditional on r, its variance is 4r 2 times the variance 
of the minimum of two independent uniform random variables on [0,1]. Thus, conditional on r, 
the variance of J\ is 2?’ 2 /9. The lemma now follows from the Functional Central Limit Theorem 
(see, for example, Ethier and Kurtz (1986), Section 7.1). ■ 

Now consider D n A. 
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Lemma 4.12 Let T > 0. 

process s (->■ (— (s, (s ) where 


As n — > oo, ( D n ’ ,D n,+ ) converges weakly to the deterministic 




r 2 /j(dr). 


Proof: Since these walks experience jumps of size 0(1/\fn) at rate 

rTZ /* 7 Z 

2ns n / rn(dr) = 2 ay/n / rp,(dr), (4.9) 

Jo Jo 


which is proportional to gn, we see that we have a strong law rescaling. In the notation above, 
conditional on r, E [^2 — 1"] = —E[Zi — Y] = |. By the law of large numbers as n —> oo, 
(D n ~, D n,+ ) converges weakly to the deterministic process s e->• (—(s,(s), with ( as in the 
statement of the lemma. ■ 


When ( L n , R n ) is coalesced, its jumps have the same distribution as (V n + D n ~, V n + D n,+ ). 
When (L n , R n ) is separated, its jumps have the same distribution as (V n ’ 1 +D n,l, ~, V n,r +D n,r ’ + ) 
where V n ’ 1 , V n ’ r are independent copies of V n and D n,l ~, D n,r,+ are independent and with the 
same distribution as D n ~, D n,+ respectively. When L n and R n are nearby the evolution is more 
complicated; in fact in this case the joint jump distribution depends on \L n — R n \. Happily, 
because of Lemma 4.15 (see below), we will not need to describe the evolution in this case 
explicitly and we will denote it simply by (AfJ 1 ’ 1 ,J\fs ,r ). 

Since ( L n , R n ) is always in exactly one of the states ‘coalesced’, ‘nearby’, and ‘separated’, and 
using spatial and temporal homogeneity of n n , it follows from the above that we can represent 
the dynamics of (L n , R n ) in terms of three independent copies of the triple ( V n , H n=t ) which we 
denote (V n ' a , D n ' a,± ) with a G { c,l,r }: 


rn rn . t rn,l . p,n,Z,— . \rn,l . -srn,c . jwn,c 

-L^f) 1 V o n | i-J On 1 l\Tn 1 V fin I J-/fn 5 

s ‘-'s 1 v s w s w s 

(4.10) 

nn _ nn . T/ n,r n n,r,+ » rn,r , T/ n,c . n n,c,+ 

JJ t s — -TLQ V gn ~r -LJ gn ~T J V jy n I V fn 1 U fn , 

(4.11) 

s = C? + N? + S», 

(4.12) 

0= f S t{R n u >L n u }dCZ. 

■Jo 

(4.13) 


Of course the ‘clocks’ ( C n , 5”, N n ) are coupled with the random walks V n,a and D n,a A. 

Our next task is to prove that the time spent in the ‘nearby’ state is negligible. We require 
two preliminary estimates on the time between changes of state. The first says that each visit 
to the nearby state lasts at most 0(l/n) units of time. The second estimate says that visits 
to the coalesced state last 0(1 /\/n) units of time. This will limit the possible number of such 
visits in the time interval [0, T] to be 0(^/n) and since, moreover, the number of visits to the 
nearby state before the pair visits the coalesced state is 0(1), this in turn allows us to control 
the number of visits to the nearby state. 


Lemma 4.13 Let k G N and let r' k , he the next state change after r k ’ N . Then the random 
variables (t^ — r k " N )are an independent sequence and there exists A G (0, oo), not dependent 
on k, such that 



n.N 

T k 


A 

n 


Further, there exists q > 0, not dependent on k, such that the probability that ( L n ,R n ) is 
coalesced at r k , is greater than q. 
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PROOF: Independence is clear since the jumps determining the distinct tL — r k ’ N are driven 
by disjoint collections of events of IP. If ( L n , R n ) are nearby they must either be at a distance 
smaller than ^IZ/y/n, or at a distance between |7 Z/y/n and 2'JZ/y/n. In the first scenario, the 
probability that they coalesce through the the next event that affects either of them is bounded 
away from 0. In the second scenario, the probability that the next event that affects either of 
them brings them closer than |7 Z/y/n is also bounded away from 0. This guarantees that the 
probability that ( L n ,R n ) is coalesced at r' k , is bounded below by some q > 0. On the other 
hand, if the walkers are at a distance in (|7£/ y/n,2TZ/yjn), the probability that they separate 
at the next step is strictly positive. Thus the number of jumps until they either coalesce of 
separate has finite mean and since events affect them at rate 0{n) the result follows. ■ 


Lemma 4.14 Let k £ N and let be the next state change after T k ’ C . Then the random 
variables (rf,, — r( ! ’ )* ;e pj are an i.i.d. sequence and there exists A' 6 (0,oo), not dependent on 
k, such that 




Proof: This is trivial, since any jump out of the coalesced state is due to a selective event 
and the rate at which these occur is given by (4.9). ■ 


Lemma 4.15 Fix T > 0 and let Nf denote the total time spent in the nearby state up to time 
T. Then Ntf —>■ 0 in probability as n —>■ oo. 


Proof: The idea is simple. Since Ct < T, using Lemma 4.14, the number of visits to the 


coalesced state in [0, T] has mean at most Ty/n/A 1 . But by Lemma 4.13 the expected number 
of visits to the coalesced state is at least q times the expected number of visits to the nearby 
state. Thus the expected number of visits to the nearby state is at most Ty/n/(qA 1 ) and since, 
again by Lemma 4.13, each has expected duration at most A/n, E[IVy] < T A/(jqA' y/n) and the 
result is proved. ■ 


Since L n and R n evolve as V n + D n ~ and V n + D n,+ respectively, both converge individually 
and so their joint law is tight. Moreover, since C n , N n and S n are continuous increasing 
processes, with rate of increase bounded by one, their joint law is also tight. Evidently we now 
have that 


(£?, K, K’ 1 , K n ’ r , V s n ’ c , D? l ’-,D? r ’+, Dg ,c, ~, D”’ c,+ , Cg, IV™ , Sf ) s >o 


is tight and by passing to a subsequence we may assume that it converges weakly to some 
limiting process 

(L S ,R S , HB l s , fBl, fBf -(s, Cs, -Cs, Cs, C s , 0, 5 S ) S > 0 , 


where B , B r and B c are independent (by construction). Here, Lemma 4.15 gives that IV™ —> 0 


By Skorohod’s Representation Theorem, by passing to a further subsequence if necessary, we 
can assume that the convergence is almost sure. We claim that the limit (L s , R S ,C S , S s ) s > o then 


satsifies (4.2 4.5). Indeed, letting n —> oo in (4.10), (4.11) and (4.12) we obtain precisely (4.2), 


(4.3) and (4.4). Note that here the term A/^’J vanishes as an easy consequence of Nf —> 0. 


Obtaining (4.5) from (4.13) requires a little more work (because the function x Ija; > 0} 


is not continuous), but we need only adapt the approach of Sun and Swart (2008). For each 
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5 > 0 let ps be a continuous non-decreasing function such that ps(u) = 0 for u E [0, <5] and 
Ps(u) = 1 for u E [5, oo). Using (4.13) we have 


0 = r t{R n s > L n s }dc™ = r t{ir e -L n Q > m 


io 


/ o 


1/2 


re 


dC- + ^ T l|^-L-E 


1/2 




>/ ps(Rs ~ L™)dCg > 0 , 

Jo 


provided that 6 > 2 R/y/n. For such n we thus have J Q T ps{Rf — L^dC™ = 0 and letting n —> oo 
we obtai n ps(R s — L s )dC s = 0 for all <5 > 0. Letting 5 —»• 0 we obtain J Q T l{i? s > L s }dC s = 0 
which is (4.5). 


This completes the proof of Proposition 4.9 


Remark 4.16 An entirely analogous proof gives convergence of a pair of backwards right and 


left-most paths to left/right Brownian motions. In view of Remark f.8, this convergence occurs 
jointly with convergence of their first meeting time. That the constants £ and £ are unchanged 


follows from Lemma f.6 


5 Spaces of cadlag paths 

In this section, we construct the space /C(M), which is the cadlag path equivalent of the state 


space introduced by Fontes et al. (2004) for the Brownian web (and later used in Sun and Swart 


(2008) for the net). 


5.1 Skorohod paths with different domains 

We begin by studying the space 

G = {g : [a g , 2] — > [— 1,1] ; g is cadlag, a g E [—1,1], g is constant on [1, 2]} . 

We wish to treat G as a space of paths with a Skorohod-like topology, but since paths in G can 
have different domains, we must extend the usual approach. We refer to Chapter 3, Section 12 


of Billingsley (1995) and Chapter 3, Section 5 of Ethier and Kurtz (1986), upon which our 


arguments are heavily based, for the standard theory of the Skorohod topology. 

For g,h E G, let A '\g, h\ denote the set of strictly increasing bijections from [a g , 2] —> [ah, 2], 
We define A [g, h ] to be the subset of A E A '[g, h] for which 


7 g,hW = sup 


log 


A(s) - A(t) 


s — t 


< oo. 


For such g,h, A we define 


and 


d{g, h, A) = sup \g(t) - h(X(t))\. 

te[cr s ,2] 


P(g, h) = inf ( 7 fl ,h(A) V d(g, h, A)). (5.1) 

Our main aim in this subsection is to show that G is a complete and separable metric space 
under the metric 

d(g,h) = p(g,h)W \a g - a h \ 


23 
























Intuitively, this says that paths in G converge if their domains converge and, as the domains 
become close, the paths also become close (in the Skorohod sense). We take a g E [—1,1] and the 
domain of g E G to be [a g , 2] for technical reasons: if instead we took the domain [a g , 1], A '[g, h] 
would be empty whenever < a g = 1. For s E [— 1,1], we write G[s] = {g £ G : cr g = s}. 

Remark 5.1 For s E [-1,1], G[s] is precisely the space of cadlag paths mapping [s, 2] -» [—1,1] 
that are constant on [1,2]. Moreover, on G[s], p coincides with the usual Skorohod metric. 


Lemma 5.2 The space (G, d) is a metric space. 


Proof: 
have A -1 


If d(g, h) = 0 then a g = 
E A [h,g\. Since 


ah, so by Remark 5.1 we have g 


h. For any A E A[< 7 , h\, we 


7 g,h{X) = sup 

(Jg<t<S<2 


log 


A(s) - A(t) 


s — t 


sup 

Oh<t<s <2 


log 


s — t 


A-i(a)-A-i(t) 


= 7 h,g(X ) 


and, similarly, d(g , h, A) = d(h,g, A^ 1 ), we have that d is symmetric. 

It remains to prove that d satisfies the triangle inequality, for which it suffices to show that 
the triangle inequality holds for p. To see this, take f,g,h€ G. For Ai E A[/, g] and A 2 E A[< 7 , h] 
we have A 2 o Ai E A[/, h\ and 


7 /,a (^2 0 Ai) = sup 

(Tf<t<S< 2 

< sup 


log 


log 


(A 2 q Ai )(s) - (A 2 q Ai)(t) Ai(s) - Ai(t) 
Ai(s) - Ai(f) 

A 2 (s) - A 2 (t) 


a g <t<s<2 S t 

= +7/,s(Al). 


+ sup 

<7 f<t<S< 2 


log 


s — t 
Ai(s) - Ai(f) 


s — t 


(5.2) 


Similarly, 


d(f,h,X 1 o A 2 ) = sup \f(t) - h{X 2 {Xi(t)))\ 
te[(Tf, 2 ] 

< sup \f(t) - g(Xi(t))\ + sup \g(t) - h(X 2 (t))\ 
te[af, 2] te[cr 9 ,1] 


= d{f,g,Xi) + d{g,h,X 2 ). 


Combining (5.2) and (5.3) we have that p(f,h ) < p(f,g ) + p(g,h), as required. 


(5.3) 


Lemma 5.3 The space (G, d) is separable. 

PROOF: Let g E G and suppose a g E (—1,1). Let {qf) be an increasing sequence in Qn (—1,1) 
such that qi f a g and, for each i, define A i : [^,2] — > [a g ,2] by setting A*(%) = a g , Aj(l) = 1, 
Aj(2) = 2 and taking A* to be linear on [qi, 1] and on [1, 2], Define gi E G[qi] by gift) = g(Xi(t)). 
Then A ^ 1 E A [g,gi] and 


7 9,9i(\ ) — 


log 


1 - qi 


1 - a n 


d(g,gi,Xi ) = 0 


Hence d(g,gi) —> 0 as i —> 00 . By Remark 5.1 for each q E Q D [—1,1] the space (G[q],p) is 
separable, hence (G, d) is separable. ■ 
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Before we address completeness, we recall that the Skorohod topology is often characterized 
using a metric with respect to which it is not complete; this characterization is useful primarily 
because it is easier to work with. The extension to G is as follows. 

For g,h € G and A E A '[g, h ] define 7 ^ h (X) = sup te [ CTg 2 ] | A(t) — t\. Then, let 

p'(g, h) = inf ( 7 ^ h (A) V d(g, h, A)) 

AeA'[g+] 

and define d'(g,h ) = \a g — o k | V p'(g,h). It can be checked, in similar style to the proof of 
Lemma [5~2| that ( G,d') is a metric space. 

Lemma 5.4 The metrics d and d' generate the same topology on G. 


Proof: First note that A [g,h\ C A '[g,h] and, since \x — 1| < eI log ^I — 1 for all x > 0, for 
A E A [g,h\ we have 


T , g,hW= SU P I* “0's I 

te(<rg, 2 ] 


A(t) - (J g 


t~ On 


- 1 


< SUP | t — <Jg\ 
te(cr g ,2] 


A(t) - a (a g ) 


t - (7n 


~ 1 


+ Wh — a q 


< 3 -1 + 1 a h - ag^j . (5.4) 


(We have used that A(ct 9 ) = 07 , and the continuity of A at a g .) 


Let ( g n ) C G and g E G. If d(g n ,g) —> 0 then it follows readily from (5.4) and the definitions 


that d'(g n ,g) —» 0. It remains to prove the converse; suppose instead that d'(g n ,g) —> 0. 
Fix JVeN. Since d'(g n ,g) —> 0 there exists a sequence E A [g n ,g\ such that 


7 g„, 9 (A„ ) V d(g n ,g, A”) V \<r gn - a g \ -> 0 


as n —> 00 . Define = a g and for k = 1,2,... define 


r k = 2 V inf { t > r fc _ x ; \g{t) - g(r k )| > — 


(5.5) 


(5.6) 


up until the first k = kjq for which r k = 2 . Since g is cadlag, (r k )^!X 0 is a finite, strictly 
increasing sequence, and = 2 . 

For each n E N, define p^ to be the unique piecewise linear function for which 

) = (A^-Vf) (5.7) 

for all k = 0,..., fcjv (and is linear in between those points). Then, p% E A '[g, g n ] and, moreover, 


~h,tn(l‘n)= SU P 


log 


r N _iV 


< 00 


so that p* E A [g,g n \. In fact, since (5.5) implies that lim n _^oo A„ (7+ ) = r k , we have 
7 g,g n (pn) — > 0 as n — > 00. Further, 

sup \g n {t) -g ((^) _ 1 (t))| 

te+ 9 n , 2 ] 

< sup \g n (t) -g()£(t))\ + sup \g (t)) - g ((p^y\t)) \ 

te[a gn , 2] te[a gn ,2] 

<d(g n ,g,\n)+ SU P (A^ o/r^(t))-c/(f)| 

*e[tr g ,2] 

2 

< d(g n ,9, ^n) + 
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Here, the final line follows from (5.6) and (5.7). Hence, recalling that d(g, g n , g% ) = d(g, g n , (g% ) _1 ) 
we have d(g , g n , g%) — > 0 as n —> oo. 

Combining the above with (5.5), we can choose a strictly increasing sequence (nAr)iVeN of 
natural numbers such that, for all n> njy, 


1 


7 g,g n (M n ) — jy- > 


d(g,g n ,g n ) < jj, 


. l 

(J n — O'n\ ^ -• 

\ 9n 91 ~ N 


Define K, n = g^ for all n E N such that n n < n < njv+i- Then d(g n , <?) < 7 9lffn (ac n )Vd(g, g n , « n )V 
ksn “ I so d(g n ,g) -> 0 as n -> oo. ■ 

The space (G, d') is not complete (to see this, note first that by Remark 5.1 and Example 12.2 


of Billingsley (1995), even the space (G[s],d') is not complete). In order to prove completeness 
of (G, d), it will be useful to note that there exists e* > 0 such that for all x e [0, e*), we have 


e x — 1 < 2x. 


(5.8) 


Lemma 5.5 The space ( G , d) is complete. 


PROOF: It suffices to show that any Cauchy sequence in ( G,d ) has a convergent subsequence. 
To this end, let (gk) be a Cauchy sequence in ( G,d ). Thus o 9k is Cauchy, which implies that 
cTg k —A a for some a 6 [—1,1]. 

With mild abuse of notation, we pass to a subsequence (g^) such that for all j > k we have 
d(g k ,gj) < 2 _fc e~ fc ~ 1 . Hence, there exists X k E A[g k ,g k +i] such that, for all k, 

Tg^+i^fc) V d(g k ,g k+ i,X k ) V \o 9k - a\ < 2“ fc A e*. (5.9) 

For each k, define : [—1,2] —> [—1,2] to be the function that is equal to on [o gk ,2] 
and, if a 9k > —1, is linear in between A*,(—1) = —1 and \k{(J 9k ) = cr 9k+1 . Note that this means 
Afc has constant gradient on [— l,o 9k \, and that X k is a continuous bijection of [—1,2] to itself. 
Thus, we have 


sup 

—l<t<2 


X k (t) - t 


< sup |Afc(t) — t 


< 3 (eWNd _ i + | o 9k+1 - 

< 3(2" fe+1 + 2~ k ) = 9 • 2~ k . 


(5.10) 


Here, the second line follows from ( |5.4[ ), and the final line from (5.8) and (5.9). 

We now construct the limit of {g k ). Define g% : [<r 9k , 2] —> [cr 9 , 2] and g% : [—1, 2] — > [—1, 2] 
by gl = X k+n o ... o X k+ i o X k , and g% = X k+n o ... o A fc+ i o A fc . By (|5.10D we have 


sup \g k +1 (t) - g k (t) | < sup \X k+n+ i(t) - t\ < 9 • 2 k n . 
te[- 1 , 2 ] [— 1 , 2 ] 

It follows that (g^)^ =1 is a Cauchy sequence in (C[_i i2 ] [—1,1], | [ • ||oo), an d hence has a limit, 
which we denote by g k . Since the g 1 ^ are increasing, it is immediate that g k (s) > g k (t) for s > t. 
We define g k ■ [<r Sfc ,2] -» [—1,1] by g k {t) = g k (t). Note that 


Tk{vg k ) = Ihn X k+n o...o X k (a 9k ) = lim a 9k = a, 

n—t oo n—>oo 

g k (2) = lim X k+n o ... o A fc (2) = 2, 

n —^oo 
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(5.11) 

(5.12) 














and also that, from (5.10), 


k+n 


|/Xfe(l) — 1| = lim 1 ^( 1 ) — 1 | < lim sup |A 7 (f) — t\ < 9 • 2 k+2 . 
n ^°° n ^°°f^ k te[a gk ,2] 


In similar style to (5.2), 


sup 

< Tg k <S<t <2 


log 




s — t 


< 


sup 


k+n 

V 

j=k <y 3j <s<t<2 

k+n 

Tgj.gj+i 

j=k 


log 


Aj(s) Aj(t) 


s — t 


< 2 


— fc+1 


Here, to deduce the final line we use (5.9). Letting n —>■ oo we have 

Atfc(s) - Wfc(i) 


sup 


log- 


s — t, 


< 2 


—fe+i 


(5.13) 


(5.14) 


(5.15) 


Consequently, g k is strictly increasing. Thus from (5.11) and (5.12), we have that fi k '■ [<r Sfc , 2] 
[ct, 2] is a strictly increasing bijection. In particular, it has an inverse /7) 1 : [a, 2] -+ [<r Sfc ,2]. 
The proof now follows the usual strategy. For each k, we define z k : [—1,2] —> [—1, 2] by 


Zk(t) = 


9k o n k (t) t € [a, 2] 

0 t E [—1, a). 


We have 


sup l^fc(i) - 2 fe+ i(t)| = sup |^ (/x*. 1 ^)) - 9k+i (Afc^fc 1 ^)))! 
iS [—1,2] iS[—1,2] 

= sup \g k (t) - g k+ 1 (A fc (t))| 

d(Qk') 9k+h Afe) 

< 2~ k 


Here, the second line follows by definition of g k and the final line follows by (5.9). Thus, by 
completeness of M, there exists a function z : [—1,2] —> [—1,1] such that 


sup \z k {t) — z(t)\ —> 0 
iS [—1,2] 


(5.16) 


as k —> oo. Since each z k is cadlag, 2 is cadlag. By (5.13) and the fact that each g k is constant 
on [1, 2], it follows from (5.16) that z is constant on (1, 2], hence by right continuity z is constant 
on [1,2]. 

We define g by setting a g = a and g(t) = z(t) on [<t 9 ,2], and note that g E G. We have 
shown above that g k is a strictly increasing bijection, hence g k E A [g k ,g\- Thus, from (5.16) we 
have 

sup \g k {t) - g(nk(t))\ = sup Igk+Z 1 ^)) - g(t)\ -+ 0 as k -+ oo. 


iS[(Tg, ,2] 


te[cr g ,2] 


Combining the above equation with (5.15), 7 gk,g(l JL k)V d(g k , g, g k ) —> 0 and since a g = lim/ c _ 5 . 00 a gk 
we have g k -+ g in (G, d). ■ 
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5.2 The space (M, g?m) 


Recall the space M from (3.1) and the notation Kt = tanh 1 (t). It will sometimes be useful to 


write n(t) = Kt- Each / E M corresponds to some / E G, essentially through the relation (3.2), 
that is 

tanh(/(«t)) 


m = 


(5.17) 


i + M 

for t E [k _ 1 ((T/), 1]. In order to treat / as an element of G we specify that additionally f(t) = 0 
for all t E [1,2]. Note that aj = n~ l (crf). In this section will use notation from Section 
without comment. 

The map />—)•/ naturally induces a pseudometric on M through the relation 


5.1 


d M {fij2)=d(hJ 2 ). 


(5.18) 

It follows immediately from Lemmas |5.2| and |5.3| that the set of equivalence classes of M, under 
cLm, form a separable metric space. Note that it is necessary to use equivalence classes, since all 
/ E .D[oo] map to the same / £ G. From now on we abuse notation slightly and write (M, <Im) 


for the metric space of equivalence classes. This defines the metric cLm that appeared in (3.3). 


Lemma 5.6 The space (M, d^i) is complete. 

Proof: Let (f k ) be a Cauchy sequence in (M, djv/). Then (f).) is a Cauchy sequence in ( G,d ) 
and by Lemma |5.5| there exists g £ G such that fa g. It remains to show that there exists 


/ £ M such that / = g, which will in turn follow immediately from (5.17) if we can show that 


lfl(*)l < 


1 


1 + \i^t\ 


for all t £ [a g , 1]. 

Equation ( 5. 19[ ) is readily seen; note that, by Lemma 5.4, fa 


exists A fc £ A [g,fa] such that l' g j k {^k) V d(g,fa,X k ) V \a g - aj k \ 
lirn^oo f k {Xk(t)). By ( |5.17| ) we have |/ fc (s)| < for all s, hence 


(5.19) 


► g implies that there 
0. Therefore, g(t) = 


Ifl'(i)! < limsup 


1 


k— xx) 1 T |^(Afc(t))| 


(5.20) 


We also have l' g j k {Xk) = sup tg [ CTg>1 ] \\ k (t) —1| —>• 0. Combining this with (5.20) proves (5.19). 


Remark 5.7 Note that d' M (fa, f 2 ) = d'(fa,f 2 ) is a pseudo-metric on M, with the same equiv¬ 
alence classes as dM■ Hence, by Lemma 5-4 d' M generates the same topology on M as dM- 


If we look at the subset M of M consisting only of continuous functions, with the metric 

d^(/i,/ 2 ) = 1^-oy 2 | V sup \fa{t Vct/J - f 2 (t Vct /2 )| (5.21) 

t6[-l,l] 


then we recover the space of continuous paths introduced by Fontes et al. (2004) (with a minor 
modification relating to the values of functions at {— 00 , 00 }, see the appendix of Sun and Swart 


2008 for details). 

We now establish the natural relationship between M and M, which mirrors the ‘usual’ 
continuous embedding of spaces of continuous paths (with the || • ||oo metric) into Skorohod 
spaces. Recall that /C(M) (resp. IC(M)) denotes the space of all compact subsets of M (resp.M). 
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Lemma 5.8 The space M is continuously embedded in M. Moreover, fC(M) is continuously 
embedded in JC(M). 


Proof: Note that the first statement follows immediately from the second, so we will prove 
only the second statement. Recall that the topology generated by the Hausdorff metric (on 
JC(M)) depends only on the underlying topology (of M), and not on the underlying metric. In 


view of this fact and Remark 5.7, for the duration of this proof we take the Hausdorff metric on 
/C(M) as that generated by ( M,d ' M ). 

Let W n ,W be subsets of /C(M) such that W n —> W in /C(M). By Lemma 


A.l 


the set 


W = W U (UneN Wn) is a compact subset of M. A characterization of relative compactness 


in M is given in the proof of Lemma 4.6 of Sun and Swart (2008), based on the Ascoli-Arzela 
Theorem (or, for a more detailed treatment, see the appendix o f|Schertzer et ah (2014)). It 
follows immediately from this characterization that the set W = {f ; f £ "W} is equicontinuous. 
Let e > 0. By equicontinuity, there exists <5 > 0 such that |s — t\ < 5 implies 


sup | f(s) - f{t) | < e. 
/e# 


(5.22) 


Without loss of generality we may choose 5 E (0, e). By definition of the Hausdorff metric, 
choose N such that for all n > N, 


sup inf dj^(g, h) < 5 and sup _ inf d^(g,h) < 5. 


geW, 


hew 


gew 


hew n 


m ' 


(5.23) 


By the first equality of (5.23), for any g E W n and n> N, there exists h E W such that 


|Og — aj) < 5 and 


sup \g(t) - h(t)\ < 5. 

t£[a s Va n , 2 ] 


(5.24) 


Define A g : [a g , 2 ] —> [cr^,2] by setting A(l) = 1, A( 2 ) = 2 and linear in between. 

Thus A g E A '[g,h]. For t E [<Jg, 2], we have 1 1 — A s (i)| < \crg — a^\ < 5. This implies that 
7 g(Ag) < e and, using (5.22) and ( |5.24 ), that 


\g(t) - h(X s (t))\ < | g(t) - g(Xg(t))\ + \g(\ s (t)) - h(\ s (t))\ < 2e. 


Thus, d'(g,h) < 2e. Similarly, using the second equality of (5.23), for any g E W and n > N, 
there exists h E W n such that d'(g, h) < 2e. We thus have, for all n > N, 


max sup inf d'(g,h), sup inf d'(g,h ) < 2 e. 

\gew„ h ^ w g£ w h ^ w n I 


(5.25) 


Hence, W n —y W in /C(M) as n -> oo. ■ 

In the interests of brevity, we limit our further development of the space (M, dM) to the 
following two results. 


Lemma 


5.9 Let f,g E M with a f = a g and sup te[(T/;00 ] | f(t) - g(t)\ < r. Then d' M (f,g ) < r. 


Proof: Since Of = a g , the identity function l is an element of A [f,g\. Note that 7 ?(t) = 0. 
Hence d' M (f, g) < sup te[tr/i0o] | f(t) - g(t)\ < sup te[fT/j2] \f(t) - g(t)\ < r, as required. ■ 


Lemma 5.10 Let ( f m ) C M and f E M with af m = a j 6 (— 00 , 00 ). Then dM(fm,f ) —>■ 0 if 
(the restrictions of) f m —tfin y](R) for all T E (af, 00 ). 
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PROOF: Note that tanh : [— 00 , 00 ] —> [—1,1] is a contraction. For T E (0, 00 ), set T = tanh(T), 
and note that tanh restricted to [— T, T] —> [— tanh(T), tanh(T)] is bi-Lipschitz. Hence there are 
constants Ct €E ( 0 , 00 ) such that 


sup 

—T<t<s<T 


K s ~ K t 

< Ct, 

sup 

s — t 

s — t 



—oo<s<t<oo 

K s - Kt 


< 1 . 


(5.26) 


Let e > 0. Let T E (07 V 0, 00 ) be such that — e ■ Thus, 

sup sup \q(t)\ < e. 
g&M te[TVa g ,2] 


(5.27) 


By Theorem 12.1 of Billingsley (1995) there exists M E N such that for all m> M there exists 
a continuous strictly increasing X m : [ 07 , T] —> [ 07 , T] with 


sup \t-X m (t)\< —, 
te[oy,T] 


sup \f(t) — fm(X m (t))\ < e. 
tep/,T] 


(5.28) 


Define X m : [ 07 , T] —> [ 07 , T} by A m (t) = k 1 oA m oK(t), and note that by (5.28) for all t € [oy, T], 


and, by the right hand side of (5.26), 


l/(*) - /m(A m (t))| = 


t - A m (f)| < C T \t - A m (f)| < e, 

| tanh(/(«*)) - tanh(/ m (/t(A m (t)))| 


(5.29) 


1 


1 + M 
< |/(«t) - /m(A m («t))| < e 


(5.30) 


Extend X m to rj m : [( 77 , 2 ] — > [aj, 2] by setting X m (t) = t for t > T. Then, combining (5.27), 
(5.29) and (5.30) we obtain 7 ^(A m ) < e and d!{f. f m , X m ) < 2e. It follows that <f(/, / m ) —>■ 0, so 
the stated result now follows by Lemma |5.4| ■ 


6 Convergence to the Brownian Net 


6.1 Compactness 


In order to use Theorem 3.4 we must verify that our various set of paths really are subsets of 


/C(M). That is, we need to show that they are compact subsets of M, which is the content of 
this subsection. We concentrate on forwards paths; analogous arguments apply to backwards 
paths. 

We require three preparatory lemmas. The first two of these embody the key features of the 
argument; at any given time, within bounded intervals of space, the number of ancestral lines 
at distinct spatial locations is finite, and there do not exist ancestral lines that move arbitrarily 
fast across space. 


Lemma 6.1 Let n E N. Then, almost surely, for all (random) a,b,t E M the set E 0 5 1 = 
[a, b} n {/(f); / E vl{V n )} is finite. 


PROOF: For fixed deterministic a,b,t, it is easily seen that the set of (y,s) for which s < t, 
y E [a, b] and the line {y} x [s, t] is not affected by any reproduction events, is almost surely 
bounded (in M 2 ). Thus, since T> n is locally finite, the set E 0 5 1 is almost surely finite. Take 
countably dense (deterministic) sequences (a m ). (b m ) and (t m ) in M; thus almost surely, for all 
mi, m 2 , m 3 E N for which a mi < b m2 , the set E amiit is finite. The stated result now follows 
from Lemma l4.ll ■ 
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Lemma 6.2 Let n E N. Almost surely, there does not exist a (random) sequence (x m ,t m ,r m )m=lC 
n n such that sup m \t m \ < oo, \\m rn ^. 00 x rn = oo and sup m — x m \ < A1Z n . 


PROOF: Let K E (0,oo). Then, the probability that 


n n n [\AkTZ n ,A(k + l)7£ n ] x [~K,K\ x [0,7£ n ]) = 0 


is positive and does not depend on k. Consequently, the probability that there exists a se¬ 
quence (x m ,t m ,r m )™ =1 C II n such that lim nwoo x m = oo and sup m \x m+ i - x m \ < 4K n , with 
sup m \t m \ < K, is zero. Since K was arbitrary, the result follows. ■ 


Recall that our ultimate goal is to prove Theorem |3.5| , which claims convergence in distri¬ 
bution. In view of this, from now on we will (abuse notation slightly and) assume that the 
conclusions of Lemma 6.2 and 6.1 hold surely. 

The next lemma asserts that any convergent sequence of paths that becomes close, in space, 
to touching oo within some bounded interval of time, must converge to a constant path at oo. 


Lemma 6.3 Let n E N. Let ( f m )m=i ^ Pn(^n) be a sequence of paths and suppose <x(/ m ) 
converges to v E [— 00 , 00 ]. Suppose also that there exists a bounded sequence ( t m ) with t m > 
a(f m ) for which f m [t m ) —> 00 as m —»• 00 . 

Lot /oo £ M be the path defined by u(/ 00 ) = v and foo(s) = 00 for all s E [u,oo]. Then 

fm /oo in M. _ _ 

Moreover, if instead (/ m ) C Vn(V n ), then under the same hypothesis f m —> /^ m M. 


Proof: Fix K E (0, 00 ), large enough that sup m \ t m \ < K. Define x(m*, K ) = inf{/ m (s); m. > 
m*,a(f m ) < s, |s| < K}. 

Suppose, for a contradiction, that x(m*,K ) does not tend to 00 as rn* —> 00 . Then, there 
exists X E (— 00 , 00 ) and infinitely many m* for which x(m *, K ) < X. For all such m* we have 
some m > m* and |s| < K such that f m {s) < X, and (by our hypothesis) as m* ->oowe have 


6.2 


also fm (pm) ~> 00 ; since f m E V^(V n ) this is a contradiction to Lemma 

So, x(m*,K ) — »• 00 as m* -» 00. Thus, for any K,X E (—00,00) we can find rn* E N such 
that, for all m > m* and s > cr(f m ) such that |s| < K, we have f m (s) > X. With this in hand, 
the stated results follow easily from (5.18) and (5.21). ■ 


Recall that, by (4.1), the path /^ in the statement of Lemma 6.3 is an element of both 
P^(D n ) and V^(T> n ). Recall also that, in the notation of ( |4.1[ ), both these sets also contain 
paths that are infinite extenders. 


Lemma 6.4 Let * E {t,i}, t £ {Z,r} and n E N. Then, V*((D n ) and Vn\v n ) are compact 
subsets of M, also V*(D n ) and Vn\v n ) are compact subsets of M. 


PROOF: As usual, it suffices to consider the case of forward paths. Since Lemma |6.3[ on which 
the following argument relies, holds in both M and M, it will suffice to consider only cases in M. 
Moreover, the arguments required the case of Vn^(V n ) are essentially identical to those required 
for the case Vn((D n ): thus, we aim to show that T\{V n ) is a sequentially compact subset of (the 
metric space) M. 

Let (/ m )m =1 — L > n{T> n ) be a sequence of paths. We must show that (f m ) has a convergent 
subsequence, with limit in v\{T> n ). 

We now split into several cases. 


31 










1. If (a(/ m )) m >i has a subsequence that converges to oo then, along this subsequence, f m 
converges to the degenerate path / with erj = oo and /(oo) = 0 . 

2. If {u(f m ))m >i has a bounded subsequence, then consider the sequence x m = 

(a) If (x m )m >l is bounded, then since V n is locally finite there must be a subsequence 
along which (cr(f m ),x m ) is eventually constant. Any given ancestral line moves to 
one of at most two locations in a reproduction event, thus (f m )m >l has a convergent 
subsequence; to construct the limit path we successively follow parent points that 
were followed by infinitely many of our f m . 

(b) If (x m )m>i is not bounded, then without loss of generality we pass to a subsequence 
and assume that both x m —» oo and a(f m ) converges. It then follows immediately 
from Lemma |6.3| that f m converges along this subsequence. 

3. If (o-(/ m )) m >i has a subsequence that converges to —oo, then pass to that subsequence 
and set t = sup m a(f m ) < oo. 


(a) If {m ; < K} is finite for all K < oo, then essentially the same argument as 

in 2(b), reliant on Lemma 6.3, shows that f m has a convergent subsequence. 

(b) If, for some K < oo the set {m ; \f m (t)\ < K} is infinite, then pass to the subsequence 
of f m such that |/ m (t)| < K. By Lemma 6.1, the set [—K,K] n {f m (t) ; m £ N} is 
finite. Hence, there is some \z\ < K through which infinitely many f m pass. Using 
the same method as in 2(a), we can construct a path / : [t, oo] with f(t) = z that 
is followed by infinitely many f m . So, pass to a further subsequence and assume 
fm{s ) = f(s) for all s>t. 

We extend / backwards in time as follows. From location ( z,t ), look backwards in 
time until the most recent reproduction event (strictly) before t that affected z, say 
p = (x,t',r). By Lemma 6.1 the set ); p affects f m ,m G N} is finite. Pick 

some element z' of this set, and restrict to f m for which = z!. Set f(s ) = z 

for s £ [t',t) set f(s) = z. Then, look back from ( z',t'), and repeat (in the language 
of (4.1), / is an ‘infinite extender’). Thus, a subsequence of (f m ) m >i converges to /. 


Since (cr(fm))m>i must have a subsequence that converges in [— 00 , 00 ], at least one of the 
above cases occurs. This completes the proof. ■ 


6.2 Convergence of multiple left/right paths 


We now extend Proposition |4.9| to larger collections of left and right-most paths. Let N E N. 


O 


i = 1,... ,N} of distinct points in M 2 and a function 
N} —> {l,r}, Sun and Swart (2008), Section 2.2, construct a system of left-right 

(Recall that two left-most paths 
describes the interaction between 


Given a finite set D = {{yi,Si) 


coalescing Brownian motions started from the points of D 
coalesce on meeting, as do two right-most paths, and that 
left-most and right-most paths.) We write 


Rt (D,0) = {RbGusd ; i = 1,..., IV} 


( 6 . 1 ) 


for this system, where Lfi’W.si) denotes the path of a Brownian motion started from (y^, Sj) with 
diffusion constant f 2 and drift ( to the left if 0(i) = l and to the right if O(i) = r. 

For each i = 1,...,AT let df = (yf,sf) G M 2 be such that d) d t = (y*,Si) G M 2 . Set 
D <n ) = { d2 ; i = 1,..., IV}. We define the set v\{D^ n \0) = {/|,..., /]y} where fj is the 
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0(i)-most forward path from d” driven by events in FPs and Vn{D^ n \ O ) for the corresponding 
space of interpolated paths. Note that both these sets are random elements of the product space 
M n . 

Lemma 6.5 Let N e N and let D^ n \D, O be as above. Then, as n —* oo, V\{D^ n \ O) converges 
weakly in M N to V^{D, O ) and vX{D^ n \0) converges weakly in M N to V^{D, O). 


Proof: The argument is essentially identical to that of the proof of Proposition 5.2 of Sun 


and Swart 

(2008 

). The construction of V^{D,0) in Section 2.2 of 

Sun and Swart 

to 

o 

o 

00 


inductive construction that views V'{D, O ) as made up of several independent pieces consisting 
of segments of either single left-most paths, single right-most paths or a pair of left/right paths. 
The same inductive construction breaks down Vn(D^ n \ O ) into corresponding pieces. The stop¬ 
ping times used in this construction are continuous functionals on M N with respect to the law 
of independent e volut ions of path s wit hin each such piece, so the first part of the lemma follows 
from Proposition 4.9 and Lemma 5.10 Similarly, Vn{D^ n \ O ) converges weakly to V^{D, O ). ■ 


6.3 Proof of Theorem 13.51 


We complete the proof of Theorem 3.5 in three steps. Recall that the Brownian net is denoted 
by A f, and recall the function TLcross defined in Section |3.2 


Lemma 6.6 As n —> oo, we have that 

U cross (vl’\V n ) U P,t’ r (Pn)) -> N, 

in distribution in /C(M). 


PROOF: We verify the conditions of Theorem 3.4 

\V n ), where f £ {l,r}. By Lemma 


xi = vl’\v. 


and xl = Vr, 


Thi s theorem is applied with 
6.4 all these sets of paths are 


(almost surely) elements of /C(M) (after rotation by 180 degrees about (0,0) for the backwards 


l\v n )uvl’ r ( 

We now check the conditions in turn. For {&/), the required statements about non-crossing 


paths). We define X n = 'H C ross('Pn (Pn) U Vn (T> n )) and similarly for X 


paths are precisely the content of Lemma 4.4 For {38), the required convergence of multiple 


left/right paths to left/right Brownian motions is precisely the content of Lemma 6.5 
We now move on to {‘to) 


of 

Remark 


lVk r {T>. 


are respectively elements 
and 


4.10 


4.16 


If {x n! i,Xn t2 ) ->• (xi,x 2 ) and l n ,r n 
started at {x n ,i,x n p), then it follows by combining Lemmas 
that {In, fn) (i,r) in distribution, where (l,r) are a pair of left/right Brownian 
motions. That the first meeting time of l n with r n also converges (jointly) in distribution to the 
first meeting time of l with r follows from Remark 


4.8 


It remains to verify {&)■ By Lemma 4.4 left-most towards and left-most backwards paths 
cannot cross, and similarly for right-most paths. Therefore, a path of X n that enters a wedge W 
of X n from the outside must enter through the southern-most point of the wedge (i.e. precisely 
where the two paths f and l defining W meet, in Figure^. Fix a wedge W of X n and denote 
this event by E\y- 

For the event E\y to occur, some reproduction event must have a potential parent situated 
at the spatial location of the southern-most point of W. The distribution of the spatial location 
of a pre-parent (in a reproduction event occurring at given time) has no atoms, and thus the 
distribution of the spatial location of the meeting point of f and l also has no atoms; hence 
almost surely E\y does not occur. 
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The set X n contains countably many paths and thus has at most countably many wedges. 
Hence, almost surely the event E\y does not occur for any wedge W of X n . Without loss of 
generality, we may assume this does not occur surely , so as (3>) holds. ■ 


Lemma 6.7 As 


n 


oo, we have that Vn{V n ) tends in distribution to A f 


Proof: If a left-most and a right-most path of V}i(T> n ) cross, then the point at which they 


cross much be within one of the sets ^x(p)(p) (defined in Lemma 4.1) associated to a selective 
event p E n n . If an interpolated arrow finishes at p, then by definition there are interpolated 
arrows ending at both potential parents of p. Consequently, there is a one to one correspondence, 
/ i —> f between paths / E Id-cross ('Pn’ l ('D n ) U Vn ,r {V n ) \ and paths f E V^(D n ) such that 


<2Ur 


( 6 . 2 ) 


for all t > Gf = afi, and f(<7f) = 


6.4 


we have that Vn.(T> n ) is an element of /C(M). Combined with (6.2) this means 


By Lemma 

that Tn(V n ) has t he same limit (in distribution) as %, 
from Lemma 


{Vn l (V n )yjVn r (V n )) as n > oo. Thus, 


,T 


6.6 


V\(T> n ) tends in distribution to A f. 


To finish, we must upgrade the result of Lemma 
paths in place of in terpolated paths. 


6.7 


from /C(M) to /C(M), and use cadlag 


By Lemma 6.4 we have that Vn(T> n ) E JC(M ) for all n. Combining this fact with Lemma 


4.2 


and noting that ther e is a one to one correspondence between paths and interpolated paths, we 

that also vX{D n ) —> A f in distribution in /C(M). This completes the 


obtain from Lemma 


6.7 


proof of Theorem 3.5 


7 Simulations 


In all the work described in Section 3.3, lineages coalesce instantly on meeting and, in particular, 
they cannot ‘jump over’ one another. Our result also requires this property, which is achieved by 


setting v = 1. In the absence of selection, it is shown in Berestycki et al. (2013) that if instead 
we fix v E (0,1), the scaling limit of the paths relating a finite sample from the population is 
a system of coalescing Brownian motions, but with ‘clock’ rate v (so that the whole process is 
slowed down). In particular, we obtain a simple time-change of the limit for v = 1. 

It is natural to ask what happens when v < 1 in the presence of selection. Our method of 
proof certainly breaks down. In particular, we can no longer trace the left/right most paths 
starting from a single point in isolation: because paths can now cross, the current left-most path 
may not be affected by an event, whereas another path in the same region is, and if the parent 
(or one of the potential parents) of the event is to the left of the current left-most path, a new 
line of descent takes over as left-most. 

We have been unable to find a rigorous result in this context and so in this brief section, 
instead, we present the results of a numerical experiment. 

In order to approximate the limiting process, we simulated the system of branching-coalescing 
lineages (which was introduced in Section 2.3). In particular, we were interested in &(y), the 


expected position of the right-most ancestor at time 1 of a single particle, which starts at the 
origin at time 0. By symmetry the same analysis applies to the left-most particle. 

As a result of the discussion above, and the result of Berestycki et al. (2013), it is natural 


to ask if &(v) varies linearly with v. It seems that this is not the case, as is shown in the left 
hand plot of Figure [5] Note that this does not remove the possibility that, as n —> 0, the SAFVS 
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Figure 5: Right-most potential ancestor. The plot on the left shows ^(u), the average 
location of the right most ancestor at time 1 of an individual located at the origin at time 0. 
Here we take n = 1000 and fa(dr) = Si. The plot contains 200 data points, each corresponding 
to a value of v G (0,1], spaced evenly along the horizontal axis. Each data point is the mean 
of 2000 independent simulations with the corresponding value of v. The plot on the right is a 
logarithmic plot of the same data. 


rescales to a Brownian net, but it does imply that the speed of such a limiting net would not 
match the speed suggested by the simple time-change in the Brownian web limit of |Berestycki 
. As can be seen from the right hand plot of Figure [5j &(v) is also not of the form 

Simulating the SAFVS when v = 1 is a quite different task from simulating it when v is close 
to 0. In the former case, it is more efficient to generate reproduction events by simulating the 
underlying Poisson Point Process, whereas in the latter case it is more efficient to track clusters 
of particles that are close enough to be affected by the same event and simulate their (correlated) 
motion directly. Our simulation employs both methods of event sampling and alternates between 
them based on which method is asymptotically more efficient for the given value of v. 

In practice, for smaller values of v (i.e. closer to 0) larger values of n are needed to accurately 
simulate the SAFVS. It seems possible that, for v closer to 0, the numerics suggested by Figure [5] 
are not a reflection of the true behaviour as n —> oo. 

//www.github.com/nicfreemanl209, 

A On compactness 

In the proof of Lemma |5.8| we used following result, which is almost certainly known but for 
which we were unable to find a reference. 

Lemma A.l Let (./#, dj?) be a metric space and let /C(_#) be the Hausdorff space of compact 
subsets of . Let (Wn)neN be a sequence in KL(.y/L) such that W n —> € JC(^K) as n -+ oo. 

Then W = Woo U(\J nm W n ) is a compact subset of M. 


The C++ code which generated the data displayed in Figure [5] can be obtained from http: 


et al. (2013 
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PROOF: Since .J7 is a metric space, a subset W of ^ is compact if and only if W is sequentially 
compact. Let (rc n ) ne N be any sequence in W and define m n = sup{m G N U { 00 } ; w n G W m }. 
We will construct a convergent subsequence of ('a’n)neN Q W • 

If {m n ; n G N} is finite (as an unordered set) then there exists m G N U { 00 } such that 
w n G W m eventually, in which case ( w n ) n has a convergent subsequence by compactness of 
W m . Alternatively, if {m n ; n G N} is infinite then, with slight abuse of notation, we may pass 
to a subsequence and assume that m n is strictly increasing to 00 . 

By definition of the Hausdorff metric, since W n —> W there exists (/i n )neN Q Woo such that 
d ^( w n , h n ) —> 0 as n —> 00 . Since is compact, (h n )neN has a convergent subsequence, and 
with further slight abuse of notation we pass to this subsequence and assume that h n —>• h G Woo 
as n —>• 00 . Then d^(w n , h) —>• 0 as n —)• 00 , and the proof is complete. ■ 
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